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(54) METHOD FOR CONTROLUNG CLEAVAGE BY OmpT PROTEASE 

(57) By clarifying XUe properties of OmpT protease, 
a novel use thereof as a protease and a novel method of 
producing a target polypeptide are provided. More par- 
ticularly, the present invention relates to a method of 
controlling cleavage of a polypeptide by OmpT protease 
which comprises converting a sequence site consisting 
of two arbitrary consecutive amino acids and/or amino 
acid(s) in the vicinity of said site in said polypeptide Into 
other amino acids, characterized by (1) setting lysine or 
arginine as the amino acid at the -1 -position concerning 
said site and setting a specific amino acid as the amino 
acid at the +1 -position; and/or (2) setting specific amino 
acid(s) as the amino acid(s) at the -4-position and/or the 
-6-position relative to said site; so that desired parts of 
said polypeptide are cleaved by OmpT protease and/or 
undesired parts of said polypeptide are not cleaved by 
ompT protease. For example, the present invention pro- 
vides a method of treating by OmpT protease a fusion 
protein having an amino acid sequence wherein the 
amino acid at the -»-1 -position is X (wherein X is an 
amino acid other than glutamic acid, aspartic acid or 
proline); and a method of controlling cleavage by con- 
verting the amino acids at the -6-position and the -4- 
position concerning the cleavage site into amino acids 
other than acidic amino acids. 
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D68criptfon 

FIELD OF THE INVENTION 

5 [0001] This invention relates to a method of controinng cleavage of a polypeptide by OmpT protease by applying 
findings on novel cleavage and recognition sites which have been found by examining the substrate specificity of 

Escherichia coli OmpT protease. 

[0002] In one aspect, the present invention relates to a method of cleaving polypeptides by using OmpT protease. 
More particularly, it relates to a method of cleaving polypeptides by utilizing novel cleavage and recognition sites of 
10 OmpT protease. 

[0003] In another aspect, the present invention relates to a method of excising physiologically active peptides, pro- 
teins and derivatives thereof from fusion proteins with the use of OmpT protease. More particularly, it relates to a 
method of efficiently producing physiologically active peptides, a protein and derivatives thereof from fusion proteins by 
examining the substrate specificity of OmpT protease, thus finding a novel cleavage method and cleavage and recog- 

15 nition sites and then utilizing the properties. 

[0004] The present invention further relates to a method of avoiding cleavage of polypeptides by OmpT protease at 
undesired sites. In particular, it relates to a method of avoiding cleavage of physiologically active peptides, proteins or 
derivatives thereof by OmpT protease in the case of being produced by host cells. That is to say, the present invention 
provides a method of making the physiologically active peptides, proteins or derivatives thereof not (or hardly) cleavable 

20 by OmpT protease by converting the amino acid sequences at the OmpT protease cleavage sites or in the vicinity 
thereof. 

PRIOR ART 

25 [0005] E cofi OmpT protease is a protease which exists in E coli outer membrane fraction and selectively cleaves 
mainly bonds between basic amino acid pairs (Sugimura, K. and Nishihara, T. J. Bacteriol. 170: 5625-5632, 1988). This 
enzyme has a molecular weight of 36,000 and seemingly falls within the category of serine proteases. Sugimura et al. 
examined the substrate specificity of this OmpT protease and reported that the enzyme specifically cleaves the peptide 
bonds at the center of basic amino acid pairs of arginine-arginlne. lysine-lysine, arginine-lysine and lyslne-arginine. In 

30 addition, cleavage sites in amino acid sequences other than the above pairs has been found. Namely, it has been 
reported that cleavage by OmpT protease arises at arginine-methionine (Zhao, G-R and Somerville. R. L J. BioL 
Chem. 268, 14912-14920, 1993), arginine-alanine (Lassen, S. F. et al. Biochem. Int. 27: 601-611^ 1992) and arginlne- 
valine (Maurer, J. J. Bacteriol. 179: 794-804, 1997). This protease is characterized In that it cleaves not all of proteins 
and peptides including these sequences but exclusively specific proteins and peptides at specific sites. For example, 7- 

35 Interferon contains 1 0 sequences as described above but 2 sequences among them are exclusively cleavable by OmpT 
protease (Sugimura, K. and Higashi. N. J. Bacteriol.170: 3650-3654. 1988). T7 RNA polymerase contains 17 
sequences as described above but 2 sequences among them are exclusively cleavable therewith (Grodberg, J. and 
Dunn, J. J. J. Bacteriol.170: 1245-1253, 1 988). These facts indicate that the above-described data on the cleavage sites 
of OmpT protease are not applicable to the estimation of its cleavage sites, different from AP-1 and trypsin enzyme 

40 which are commonly employed in peptide mapping of proteins and the cleavage sites of which can be estimated on the 
basis of known data. Since the cleavage by OmpT protease arises at specific sites of proteins or peptides, it is antici- 
pated that amino acid sequences other than the amino acid sequences as described above (namely, the N-terminal and 
C-temninal amino add sequences of the cleavage site) may participate in the cleavage. However, it still remains 
unknown so far what amino acid sequence allows (or does not allow) the cleavage. 

45 [0006] However, OmpT protease has found use in excising target polypeptides from fusion proteins constructed by 
gene recombination techniques, since it has high specificity to cleavage sites and is one of endogenous proteases of 
E. coli. To release and produce cholesterol esterase by using E. coli, Hanke et al. succeeded that the esterase was 
fused with E coli hemolysin A protein and the fusion protein was released out of cells, and then the protein was treated 
by OmpT protease on the outer membrane to thereby successfully obtain active cholesterol esterase from the fusion 

so protein. Hanke et ai. employed a linker having an arginine-lysine sequence and cleaved this sequence by OmpT pro- 
tease (Hanke, C et. al. Mol. Gen Genet.233: 42-48, 1992). 

[0007] The present inventors found that OmpT protease is resistant to denaturing agents and they clarified that 
fusion proteins expressed as Inclusion body can be cleaved in the presence of a denaturing agent by taking advantage 
of the above property Namely, the present inventors successfully produced V8 protease derivative having the enzy- 
55 matic activity by expressing S. aureus V8 protease derivative fusion protein as Inclusion body in an E coli expression 
system, solubilizing the same by urea, then releasing the V8 protease derivative moiety from the fusion protein by using 
OmpT protease in the presence of urea and finally refolding (Yabuta. M., Ochi, N. and Ohsuye, K. Appl. Microbiol. Bio- 
tech no 1.44: 11 8- 125. 1995). 
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[0008] To release target peptides or target proteins from fusion proteins, it has been a practice to employ enzymes 
having high specificity to amino add sequences. Known examples of proteases employed for this purpose include Xa 
factor, thrombin, enterokinase and the like which are enzymes originating in mammals and supplied only In a small 
amount at a high cost Therefore, these enzymes are unsuitable for the industrial treatment of peptides and proteins by 

5 the fusion protein method on a mass scale. When the target peptide or protein is to be used as a medicine, it Is also 
required to take into consideration viral contamination originating in the enzymes. In contrast thereto, OmpT protease 
is clearly superior to these enzymes in supply, cost and safety because of originating in E. coli. 
[0009] However, the substrate specificity of this protease has not been sufficiently studied yet and, therefore, it is 
difficult at the present stage to arbitrarily design the cleavage site at the desired part to be excised. Moreover, OmpT 

10 protease cleaves not all of the sequences reported above (i.e., arginine-arginine, lysine-tystne. arginine-tysine, iysine- 
arginine, arginine-methionine. arginine-alanine and arginine-valine) but exclusively specific sites in proteins. When one 
of sequences consisting of these two amino acids is merely located in a linker site of fusion proteins, therefore, this site 
cannot always be cleaved by OmpT protease. Even though it can be cleaved, OmpT protease cleaves the peptide bond 
at the center of the cleavage site consisting of two amino acids. Therefore, the amino acid located at the <»>1 -position of 

15 the cleavage site will be added to the N-terminus of the target polypeptide when this enzyme cleaves a fusion protein 
comprising a protective peptide, the amino acid sequence of the cleavage site of OmpT protease and the target 
polypeptide in this order. Moreover, this added amino acid cannot be arbitrarily selected but restricted to arginine, 
lysine, valine, alanine or methionine on the basis of the recognition sequences of OmpT protease reported hitherto. 
These properties of OmpT protease are unfavorable as a protease to be used in cleaving fusion proteins. 

20 [001 0] On the other hand, it is known that the cleavage efficiency of papain, which is a protease, is affected not only 
by the sequence of the cleavage site in the substrate but also by the amino acid sequences in the vicinity thereof 
(Schechter. I, and Berger, A. Biochem. Biophys. Res. Commun. 27: 157-162 1967). Recently, detailed studies are also 
made on Kex2 (Rockwell, N. C, Wang. G. T, Krafft, G. A. and Fuller, R. S. Biochemistry 36. 1912-1917 1997) andfurin 
(Krysan, D. J.. Rockwell, N. C. and Fuller, R. S. J. Biol. Chem.274, 23229-23234 1999) which are proteases cleaving 

25 the C-terminal side of basic amino acid pairs. In Kex2 and furin, the consensus sequences at the cleavage sites and 
amino acid sequences in the vicinity thereof have been clarified by comparing the amino acid sequences of the sub- 
strates. In the case of OmpT protease, it is considered, on the basis of the comparison of the substrates known hitherto, 
that arginine or lysine is essentially required as the amino acid at the 1 -position in the N-terminal side of the cleavage 
site but no other clear consensus sequence has been found out so far. Although it is presumed that the recognition of 

30 the cleavage site and the cleavage efficiency of OmpT protease might be also affected not only by the cleavage site in 
the substrate but also by the amino acid sequences in the vicinity thereof, it is impossible at the present stage to control 
the cleavage by OmpT protease by using these properties. 

[0011] In the present invention, the location of each amino acid in polypeptides is represented as follows. A 
sequence site consisting of two arbitrary consecutive amino adds in the polypeptide is refen'ed to as the cleavage site 

35 or the site to be cleaved by OmpT protease. Between the amino acids concerning this site, the amino acid in the N-ter- 
minal side is referred to as the -1 -position while the amino acid in the C-terminal side is referred to as the +1 -position. 
Then the amino acids at the 1st, 2nd, 3rd, and so on in the N-terminal side of this site are refen'ed to respectively as the 
amino acids at the -1 -2-, -3-posltlons and so on, while the amino acids at the 1 st, 2nd, 3rd, and so on In the C-terminal 
side of this site are referred to respectively as the amino acids at the +1 -, +2-, +3-positions and so on. When amino acid 

40 substitution is introduced into this site or in the vicinity thereof so that the site becomes not cleavable or cleavable. the 
con-esponding amino acids in the sequence are represented by the above-described numbering. 
[0012] When an amino add sequence leucine-tyrosins-lysine-arginine-histidine is to be cleaved at the bond 
between lysine and arginine (i.e.. the two arbitrary consecutive amino acids), for example, leucine, tyrosine, lysine, 
arginine and histidine serve respectively as the amino acids at the -3-, -2-, -1 +1 - and +2-positions. 

45 

SUMMARY OF THE INVENTION 

[0013] As described above, OmpT protease is highly useful. When OmpT protease is used as an enzyme cleaving 
fusion proteins, however, there arise some problems at the present stage such that it is unknown how to design the 

so amino acid sequence at the cleavage site to enable specific cleavage, that the target peptides obtained by the cleavage 
are restricted due to the restriction on the N-terminal amino acids of the target peptides, and that cleavage cannot be 
efficiently performed. However it is expected that these problems can be solved by further studying the amino acid 
sequences at the cleavage site and in the vicinity thereof and establishing a novel cleavage method or novel recogni- 
tion/cleavage sequences, thereby making OmpT protease further useful in cleaving fusion proteins. 

55 [0014] On the contrary, there sometimes arises a problem of the cleavage by OmpT protease in the production of 
peptides or proteins by using E. colL The cleavage by OmpT protease may be avoided by, for example, using an OmpT 
protease-deficient E. coli strain as a host or by adding an OmpT protease inhibitor in the steps of the incubation and 
purification. However, these methods have not been generally employed because such a mutant strain as employed in 
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the former method is inadequate as a host in some cases, and the addition of enzyme inhbitors in the tatter method 
causes an increase in the production cost or it is feamd that the inhibitors might remain in the product. In these cases, 
moreover, ft is impossible to use OmpT protease as an enzyme for cleaving fusion proteins. 
[0015] If the cleavage could be avoided by converting amino add sequence at the undesired cleavage site by 
5 OmpT protease or in the vicinity thereof, OmpT protease might be usable as a cleavage enzyme. Therefore it is 
expected that cleavage can be efficiently avoided while minimizing the conversion of the amino acid sequence by. If pos- 
sible, clarifying the characteristics of the recognition/cleavage sequences of OmpT protease. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 

[0016] 

Fig. 1 is a diagrammatic illustration of the construction of pG1 1 7S4HR6GLP-1 . wherein p-gall 1 7S4H represents a 
region encoding a protective protein derived from the N-terminal 117 amino acids of E. coli p-galactosidase; 

15 PSRHKR represents a region encoding an amino acid sequence PSRHKR (SEQ ID NO:60): GLP-1[G1 represents 
a region encoding human glucagon-IIke peptlde-1; R6 represents a region encoding an amino acid sequence 
QMHGYDAELRLYRRHHRWGRSGS (SEQ ID N0:61); Tc' represents a tetracycline- resistance gene; and lac PO 
represents E. coli lactose promoter operator gene. With respect to pGI 17S4HGR see Japanese Laid-Open Patent 
Publication No. 9-296000 and EP 794255. 

20 Fig. 2 is a diagrammatic illustration of the construction of pGII 7S4HompRHKR, wherein p-gal1 1 7S4H represents 
a region encoding a protective protein derived from the N-temninal 1 1 7 amino acids of E. ooti p-galactosidase; 
PSRHKR represents a region encoding an amino acid sequence PSRHKR; GLP-1[G] represents a region encod- 
ing human glucagon-like peptide-1; R6 represents a region encoding an amino acid sequence QMHGYDAELR- 
LYRRHHRWGRSGS; Tc^ represents a tetracycllne-resistance gene; L1 represents an amino acid sequence 

25 QMHGYDAELRLYRRHHGSGS (SEQ ID NO:62); and lac PO represents E. coli lactose promoter operator gene. 

Fig. 3 is a diagrammatic illustration of the construction of pG1 1 7S4HompRHPR, wherein p-gal1 17S4H represents 
a region encoding a protective protein derived from the N-terminal 117 amino acids of E. coli p-galactosidase; 
PSRHKR represents a region encoding an amino acid sequence PSRHKR; GLP-1[G) represents a region encod- 
ing human glucagon-IIke peptlde-1; R6 represents a region encoding an amino acid sequence QMHGYDAELR- 

30 LYRRHHRWGRSGS; Tc^ represents a tetracycllne-resistance gene; L1 represents an amino acid sequence 
QMHGYDAELRLYRRHHGSGS; PSRHPR represents a region encoding an amino acid sequence PSRHPR(SEQ 
ID NO:63); Linker peptide represents a region encoding an amino acid sequence QMHGYDAELRLYRRHHGSG- 
SPSRHPR (SEQ ID NO:64) wherein L1 is a synthetto DNA ligated to PSRHPR; and lac PO represents E coti lac- 
tose promoter operator gene. 

35 Fig. 4 shows the whole amino acid sequence of a fusion protein PR encoded by pGI 17S4HompRHPR, wherein 
the underlined part con-esponds to the amino acid sequence of human glucagon-like peptide-1 (GLP-1[G]); the 
double-underlined part corresponds to arginine having been converted into another amino acid; and the arrow 
shows the cleavage site by OmpT protease. The numerical symbols show the amino acid numbers counting from 
the N-terminus. The protective protein (P-gal117S4H) derived from the N-terminal 117 amino acids of E. coli p- 

40 galactosidase comprises the amino acid sequence from methionine at the 1 -position to arginine at the 127-posi- 
tion. The linker peptide comprises the amino acid sequence from glutamine at the 128-position to arginine at the 
153-position. Pre-GLP-1[G] comprises an amino acid sequence from arginine at the +1 -position (concerning the 
cleavage site by OmpT protease) to glycine at the 184-position. 

Fig. 5 is a diagrammatic illustration of the construction of pG117ompPRX, wherein p-gal117S4H represents a 
45 region encoding a protective protein derived from the N-terminal 1 1 7 amino acids of E. coli p-galactosidase; GLP- 
1 [G] represents a region encoding human glucagon-like peptlde-1 ; Tc*" represents a tetracycllne-resistance gene; 
Linker peptide represents a region encoding an amino acid sequence from glutamine at the 128-position to arginine 
at the 153-position; and lac PO represents E. coli lactose promoter operator gene. 

Fig, 6 is a diagrammatic illustration showing the structure of a fusion protein PRX encoded by pG117ompPRX, 
so wherein numerical symbols show the amino acid numbers counting from the N-terminus of the fusion protein PRX. 
p-gal117S4H represents a protective protein derived from the N-tenninal 117 amino acids of E. coli p-galactosi- 
dase; GLP-1[G] represents a human glucagon-like peptlde-1; Pre GLP-1[G] represents a target peptide comprising 
an amino add sequence from the 141- to 1 84-positions containing GLP-1[G]; and Linker peptide represents an 
amino add sequence from glutamine at the 12B-position to arginine at the 153-position. A site (arginine 140-X141) 
55 corresponding to the OmpT protease cleavage site in the fusion protein PR is shown in this figure. 

Fig. 7 is a diagrammatic Illustration of the construction of pOmpTTc, wherein p-gal117S4H represents a region 
encoding a protective protein derived from the N-terminal 117 amino acids of E. coli P-galactosldase; GLP-1[G] 
represents a region encoding human glucagon-like peptide-1 ; Tc^ represents a tetracycllne-resistance gene; Ap*^ 
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represents an annpicillin*resistance gene; Linker peptide represents a region encoding an annino acid sequence 
QMHGYDAELRLYRRHHGSGSPSRHPR; OmpT represents OmpT protease gene; lac PO represents £ ooii lac- 
tose promoter operator gene; and trpP represents £ ooU tryptophan promoter gene. 

The nucleotide sequence from the transcr^nion initiation site to the oodon of the fifth amino add following the 
OmpT protease translation initiation is 

5 " AATTGTGAGCGGATAAOUITTTCACACAGGA^^ * 
(SBQ ID NO:65) 



wherein the underlined part con-esponds to the EcoRI recognition site. 

With respect to pGP501, see K. Sugimura. Biochem. Biophys. Res. Commun. 153: 753-759. 1988. 
Fig. 8 is a diagrammatic illustration of the construction of pOmpTTcB, wherein OmpT represents OmpT protease 
gene; Tc** represents a tetracycline-resistance gene; and lac PO represents E. coli lactose promoter operator gene. 

The nucleotide sequence from the transcription initiation site to the codon of the fifth amino acid following the 
OmpT protease translation initiation is 

5 ' AATTGTGAGCGGATJIUICAATTTCACACAGGAA^^ ' 
(SEQ ID NO:66) 



wherein the underlined part con'esponds to the EcoRI recognition site. 

Fig. 9 is a diagrammatic illustration of the construction of pOmpTTcC, wherein OmpT represents OmpT protease 
gene; Tc"^ represents a tetracycline-resistance gene; and lac PO represents E. coli lactose promoter operator gene. 

The nucleotide sequence fmm the transcription initiation site to the codon of the fifth amino acid following the 
OmpT protease translation initiation is 

5 ' AATTGTGAGCGGATAAAAATTACAGACAGGAAfi&mS^TGCGGGCGAAACT^ ' 
(SEQ ID NO: 67} 

wherein the underlined part corresponds to the EcoRI recognition site. 

Fig. 10 is a diagrammatic illustration of the construction of pOmpTTcE, wherein OmpT represents OmpT protease 
gene; Tc'' represents a tetracycline-resistance gene; and lac PO represents E. coli lactose promoter operator gene. 

The nucleotide sequence from the transcription Initiation site to the codon of the fifth amino acid following the 
OmpT protease translation initiation is 

5 ' AATTGTGAGCGG&TAAAAATTACAGillCAGGAAfi&mi:^^ * 
(SEQ ID NO: 68) 



wherein the underlined part con-esponds to the EcoRI recognition site. 

Fig. 11 is an SDS-PAGE (16%) photograph relating to the cleavage of the fusion protein PRX by OmpT protease. 

In this figure, Mr represents molecular weight marker proteins; O represents the lane for purified OmpT pro> 
tease; - represents a lane free from OmpT protease; and + represents a lane with the addition of OmpT protease. 
A : PRA, V : PRV. L : PRL, I : PRI. P : PRP, F : PRF, W : PRW. M : PRM, G : PRG. S : PRS. 
T : PRX C : PRC, Y : PRY, N : PRN. Q : PRO, D : PRD, E : PRE, K : PRK, R : PRR. H : PRH. 

The 4.9kDa peptide fragment means a peptide fragment containing GLP-1[G] whtoh has been excised by 
OmpT protease. 

Fig. 12 is a diagrammatic illustration of the construction of pG117ompPKX, wherein p-gal117S4H represents a 
region encoding a protective protein derived from the N-terminal 117 amino acids of E. coli 3-galactosidase; GLP- 
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1[Q] represents a region encoding human gtucagon-like peptide-1; Tc'' represents a tetracyciine-resistance gene; 
tinker peptide represents a region encoding an amino add sequence from glutamine at the 1 28-position to argtnine 
at the 153i)osttion; and lac PO represents £ coil lactose promoter operator gene. 

Rg. 13 is a diagrammatic Illustration showing the structure of a fusion protein PKX encoded tyy pGIITompPKX. 
wherein numerical symbols show the amino add numbers counting from the N-terminus of the fusion protein PKX. 
P<ial117S4H represents a protective protein derived from the N-temriinal 117 amino acids of E cofi p-galactosi- 
dase; GLP-1[G] represents a human glucagon-lilce peptlde-1 ; Pre GLP-1[G] represents a target peptide comprising 
an amino acid sequence from the 141- to 184-positions containing GLP-1[G); and Linlcer peptide represents the 
amino acid sequence from glutamine at the 1 28-position to arginine at the 153-position. A site (lysine 140-X141) 
con-esponding to the OmpT protease cleavage site in the fusion protein PRR is shown in this figure. 
Fig. 14 is an SDS-PAGE (16%) photograph relating to the cleavage of the fusion protein PKX by OmpT protease. 
In this figure. Mr represents molecular weight marker proteins; O represents purified OmpT protease; - represents 
a lane free from OmpT protease; and + represents a lane with the addition of OmpT protease. 

KA. KS. KK. KR. KD and KE represent respectively PKA. PKS. PKK, PKR. PKD and PKE. 

The 4.9kDa peptide fragment means a peptide fragment containing GLP-1 [G] which has been excised by 
OmpT protease. 

Fig. 15 is a diagrammatic illustration of the construction of pG1 1 7ompPRhANR wherein p-gat1 1 7S4H represents 
a region encoding a protective protein derived from the N-tenninal 117 amino acids of E. coli p-galactosidase; 
GLP-1[G] represents a region encoding human glucagon-like peptide-1; a-hANP represents a region encoding a- 
type human atrial natriuretic peptide; Tc^ represents a tetracycline-resistance gene; Linker peptide 1 represents a 
region encoding an amino acid sequence QFK (SEQ ID NO:69); linker peptide 2 represents a region encoding an 
amino acid sequence QMHGYDAELRLYRRHHGSGSPYRHPR (SEQ ID NO:70): Linker peptide 3 represents a 
region encoding an amino acid sequence QMHGYDAELRLYR (SEQ ID N0:71); and lac PO represents E. coli lac- 
tose promoter operator gene. With respect to pGH a97SII, see 'Dak:hokin o shukushu toshita seirikassei peputido 
seisankei ni kansuru kenkyu (Study on Physiologically Active Peptide Production System with the Use of E. coli as 
Host)*, Koji Magota, Doctoral Dissertation, Kyushu University, 1991. 

Fig. 16 is a diagrammatic illustration showing the structure of a fusion protein PRhANP encoded by 
pG117ompPRhANP, wherein numerical symbols show the amino acid numbers counting from the N-terminus of 
the fusion protein PRhANP. p-gal117S4H represents a protective protein derived from the N-terminal 117 amino 
acids of E. coli p-galactosidase; a-hANP represents an a-human atrial natriuretic peptide; and Linker peptide rep- 
resents the amino acid sequence from glutamine at the 1 28-position to arginine at the 140-position. A site (arginine 
140-serine 141) corresponding to the OmpT protease cleavage site in the fusion protein PRR is shown in this fig- 
ure. 

Rg. 17 is a diagrammatic illustration of the construction of pGI 17ompPRhCT, wherein p-gal1 17S4H represents a 
region encoding a protective protein derived from the N-temriinal 117 amino acids of E coli p-galactosidase; p- 
gal197S4D represents a region encoding a protective protein originating in 97 amino acids from the N-terminus of 
E. coli p- galactosidase; GLP-1 [G] represents a region encoding human glucagon-like peptide-1; Tc'' represents a 
tetracycline-resistance gene; Linker peptide 1 represents a region encoding an amino acid sequence EFRHHR- 
RHRLE (SEQ ID NO:72); Linker peptide 2 represents a region encoding an amino acid sequence QMHGYDAEL- 
RLYRRHHGSGSPYRHPR; Linker peptide 3 represents a region encoding an amino acid sequence 
QMHGYDAELRLYR; and lac PO represents E. coli lactose promoter operator gene. With respect to 
pG97S4DhCT[G]R4, see Yabuta, M., Suzuki, Y. and Olisuye, K. Appl. Microbiol. Biotechnol.42: 703-708. 1995. 
Rg. 18 is a diagrammatic illustration showing the structure of the fusion protein PRhCT encoded by 
pG1 1 7ompPRhCX wherein numerical symbols show the amino acid numbers counting from the N-temninus of the 
fusion protein PRhCT. p-gal1 1 7S4H represents a protective protein derived from the N-terminal 1 1 7 amino acids 
of E. coli p-galactosidase; hCT[G3 represents a human calcitonin precursor; and Linker peptide represents the 
amino acid sequence from glutamine at the 1 28-position to arginine at the 140-position. A site (arginine 140- 
cysteine 141) con'esponding to the OmpT protease cleavage site in the fusion protein PRR is shown In this figure. 
Fig. 19 is an SDS-PAGE (1 6%) photograph relating to the cleavage of the fusion proteins PRhANP and PRhCT by 
OmpT protease. In this figure, Mr represents molecular weight marker proteins; O represents purified OmpT pro- 
tease: - represents a lane free from OmpT protease; and -i- represents a lane with the addition of OmpT protease. 

hANP and hCT represent respectively PRhANP and PRhCT. 
Rg. 20 is a diagrannmatic illustration of the construction of pGRShANR wherein p-gai197S represents a region 
encoding a protective protein derived from the N-terminal 97 amino acids of E. coli p-galactosidase; a-hANP rep- 
resents a region encoding a-type human atrial natriuretic peptide; Tc^ represents a tetracycline-resistance gene; 
Linker peptide 1 represents a region encoding an amino acid sequence QFK; Linker peptide 2 represents a region 
encoding an amino acid sequence QFR (SEQ ID NO:73); and lac PO represents £. coli lactose promoter operator 
gene. 



6 



EP1076097A1 



Rg. 21 is a diagrammatic illustration showing the whole amino acid sequence of a fusion protein RShANP encoded 
by pQRShANP wherein the underlined part represents the amino acid sequence of a-hANP (a-type human atrial 
natriuretic peptide); the dout>le-under11ned part represents serine having been converted into another amino add; 
and the arrow shows the cleavage site by OmpT protease. The numerical symbols show the amino add numbers 
5 counting from the N-terminus. The protective protein (^al97S) derived from the N-tentiinal 97 amino adds of £ 
coH p-galactosidase comprises the amino acid sequence frohi methionine at the 1 -position to alanine at the 98* 
position. The linker peptide comprises the amino acid sequence from glutamine at the 99-position to arginine at the 
101 -position. 

Rg. 22 is a diagrammatic illustration of the construction of pGRXhANP, wherein p-gaI97S represents a region 
10 encoding a protective protein derived from the N-terminal 97 amino acids of £ coli 3-galactosidase; Modified a- 
hANP represents a region encoding an a-type human atrial natriuretic peptide derivative having sut)stitution of the 
N-temninal amino acid Into arginine, alanine or cysteine; Tc^ represents a tetracycline-resistance gene; Linker pep- 
tide represents a region encoding an amino acid sequence QFK; and lac PO represents £ coli lactose promoter 
operator gene. 

15 Fig. 23 is a diagrammatic illustration showing the structure of a fusion protein RXhANP, wherein numerical symbols 
show the amino acid numbers counting from the N-terminus of the fusion protein PRhANR p-gal97S represents a 
protective protein derived from the N-tenminal 97 amino acids of E. coli p-galactosidase; Modified a-hANP repre- 
sents an a-type human atrial natriuretic peptide derivative having a substitution of the N-termlnal amino acid into 
arginine, alanine or cysteine; and Linker peptide represents the amino acid sequence from glutamine at the 99- 

20 position to arginine at the 1 01 -position. A site (arginine 101 -XI 02) corresponding to the OmpT protease cleavage 
site in the fusion protein RShANP is shown in this figure. 

Rg. 24 is an SDS-PAGE (16%) photograph relating to the cleavage of the fusion proteins RShANP and RXhANP 
by OmpT protease. 

In this figure, Mr represents molecular weight marker proteins; - represents a lane free from OmpT protease; 
25 and + represents a lane with the addition of OmpT protease. RS, RR, RA and RC respectively represent RShANR 
RRhANR RAhANP and RChANR 

Fig. 25 is a diagrammatic illustration showing the structure of a fusion protein PRRXA encoded by 
pG117ompPRRXA, wherein -10, -5, -1, +1 and +4 respectively show the -10, -5, -1, +1 and -H*-positions concern- 
ing the OmpT protease cleavage site of the fusion protein PRR. The amino acid sequence (from -10- to +4-posi- 

30 tions) of the fusion proteins PRR and PRRXA are given in the figure. p-gai1 1 7S4H represents a protective protein 
originating in the N-termlnal 117 amino acids of £ coli p-galactosidase; GLP-1[G] represents a human glucagon- 
like peptide-1; and Linker peptide con^esponds to the amino acid sequence from glutamine at the 12B-position to 
arginine at the 153-position in the amino acid sequence shown in Fig. 6. The OmpT protease cleavage site in the 
fusion protein PRR is shown in this figure. The fusion proteins are represented in bold figures and the substituted 

35 alanine is underlined. 

Fig. 26 is a diagrammatic illustration of the construction of pGI 17ompPRR-2A, -3A and -4A. p-gal1 17S4H repre- 
sents a region encoding a protective protein derived from the N-terminal 117 amino acids of E. coli p-galactosi- 
dase; GLP-1[G] represents a region encoding human glucagon-like peptide-1: Tc^ represents a tetracycline- 
resistance gene; Linker peptide corresponds to the region encoding an amino acid sequence from glutamine at the 

40 128-position to arginine at the 153-position in the amino acid sequence shown in Fig. 6; and lac PO represents £ 
coli lactose promoter operator gene. 

Fig. 27 is a diagrammatic illustration of the construction of pGI 17ompPRR-5A and -6A, wherein p-gal1 1 7S4H rep- 
resents a region encoding a protective protein derived from the N-terminal 117 amino acids of £ coli P-galactosi- 
dase; GLP-1[G] represents a region encoding human glucagon-like peptide-1; Tc^ represents a tetracycline- 
's resistance gene; Linker peptide corresponds to the region encoding an amino acid sequence from glutamine at the 
128-position to arginine at the 153-position in the amino acid sequence shown in Fig. 6; and lac PO represents £ 
coli lactose promoter operator gene. 

Fig. 28 is a diagrammatic illustration of the construction of pG117ompPRR-8A, -9A and -IDA, wherein p- 
gal117S4H represents a region encoding a protective protein derived from the N-terminal 117 amino acids of £ 
50 coli p-galactosidase; GLP-1 [G] represents a region encoding human glucagon-like peptide-1 ; Tc*^ represents a tet- 
racycline-resistance gene; Linker peptide corresponds to the region encoding the amino acid sequence from 
glutamine at the 128-position to arginine at the 153-position in the amino acid sequence shown in Fig. 6; and lac 
PO represents £ coli lactose promoter operator gene. 

Rg. 29 is a diagrammatic illustration of the construction of pG1 1 7ompPRR-1A, wherein p-gal1 1 7S4H represents a 
55 region encoding a protective protein derived from the N-terminal 117 amino acids of £ coli p-galactosidase; GLP- 
1[G] represents a region encoding human glucagon-like peptide-1; Tc"" represents a tetracycline-resistance gene; 
Linker peptide corresponds to the region encoding the amino acid sequence from glutamine at the 128-position to 
arginine at the 153-position in the amino acid sequence shown in Rg. 6; and lac PO represents £ coli lactose pro- 
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. moter operator gene. 

Rg. 30 is an SDS-PAGE (16%) photogreph relating to the cleavage of the fusion proteins PRR and PRRXA by 
OmpT protease. In this figure, Mr represents a protein molecular weight marker, O represents purified OmpT pro- 
tease; - represents a lane free from OmpT protease; and + represents a lane with the addition of OmpT protease. 
5 -lA: PRR-1A. -2A: PRR.2A. -3A: PRR-SA, -4A: PRR-4A, -5A: PRR-SA 

-6A: PRR-6A, -8A: PRR-8A, -9A: PRRt9A, -IOA: PRR-IOA. 

The 4.9kDa peptide fragment means a peptide fragment containing GLP-1[G] which has been excised by 
OmpT protease. 

Fig. 31 is a diagrammatic illustration showing the structure of the fusion protein PRR-4X encoded by 
10 pG11 7ompPRR-4X, wherein -1 0, -5, -1 , +1 and +4 respectively show the -1 0, -5. -1 , +1 and +4-positions concern- 
ing the OmpT protease cleavage site of the fusion protein PRR. The amino add sequence (from -lO- to 44-posi- 
tions) of the fusion proteins PRR and the substituted amino acid at the -4-position of the fusion protein PRR-4X are 
given in the figure. p-gal1 1 7S4H represents a protective protein derived from the N-terminal 1 17 amino acids of E 
coli p-galactosidase; GLP-1 [G) represents a human glucagon-like peptide-1 ; and Linker peptide con-esponds to an 
15 amino acid sequence from glutamine at the 1 28-position to arginine at the 1 53-position in the amino acid sequence 
shown in Fig. 6. The OmpT protease cleavage site in the fusion protein PRR is shown in this figure. The fusion pro- 
teins are expressed in bold figures. 

Fig. 32 is a diagrammatic illustration showing the structure of the fusion protein PRR-6X encoded by 
pG117ompPRR-6X, wherein -10, -5, -1, +1 and +4 respectively show the -10, -5,-1, +1 and +4-positions concern- 

20 ing the OmpT protease cleavage site of the fusion protein PRR. The amino acid sequence (from -10- to 44-posi- 
tions) of the fusion proteins PRR and the substituted amino acid at the -6-position of the fusion protein PRR-6X are 
given in the figure. P-gal1 1 7S4H represents a protective protein derived from the N-temfiinal 1 1 7 amino acids of E. 
coli p-galactosidase; GLP-1 [G] represents a human glucagon-like peptide-1 ; and Linker peptide con-esponds to an 
amino acid sequence from glutamine at the 128-position to arginine at the 153-posrtion in the amino acid sequence 

25 shown in Fig. 6. The OmpT protease cleavage site in the fusion protein PRR is shown in this figure. The fusion pro- 
teins are represented in bold figures. 

Fig. 33 is a diagrammatic illustration of the construction of pG1 1 7ompPRR-4X (wherein X is K, D, E, N or Q). In this 
figure, p-gal1 17S4H represents a region encoding a protective protein derived from the N-terminal 1 17 amino acids 
of E coli p-galactosidase; GLP-1 [G] represents a region encoding human glucagon-like peptide-1 ; Tc^ represents 
30 a tetracyciine-resistance gene; Unker peptide corresponds to the region encoding the amino acid sequence from 
glutamine at the 128-position to arginine at the 1 53-position in the amino acid sequence shown in Fig. 6; and iac 
PO represents E coli lactose promoter operator gene. 

Fig. 34 is a diagrammatic illustration of the construction of pG1 1 7ompPRR-6X (wherein X is K, D, E, N or 0). In this 

figure, p-gal1 17S4H represents a region encoding a protective protein derived from the N-terminal 1 1 7 amino acids 
35 of E coli p-galactosidase; GLP-1 [G] represents a region encoding human glucagon-like peptide-1 ; Tc*" represents 

a tetracyciine-resistance gene; Linker peptide corresponds to the region encoding the amino acid sequence from 

glutamine at the 128-position to arginine at the 1 53-position In the amino acid sequence shown in Rg. 6; and lac 

PO represents E coli lactose promoter operator gene. 

In Fig. 35, A is an SDS-PAGE (16%) photograph relating to the cleavage of the fusion proteins PRR and PRR- 
40 4X by OmpT protease. In this figure, Mr represents a protein molecular weight marker; - represents a lane free from 

OmpT protease; and + represents a lane with the addition of OmpT protease. 

-4K: PRR-4K. -4A: PRR-4A. -4N: PRR-4N. -40: PRR-4Q. -40: PRR-4D, -4E: PRR-4E 

The 4.9kDa peptide fragment means a peptide fragment containing GLP-1 [G] which has been excised by 

OmpT protease. 

45 In Fig. 35, B is an SDS-PAGE (16%) photograph relating to the cleavage of the fusion proteins PRR and PRR- 

4X by OmpT protease. In this figure, Mr represents a protein molecular weight marker; - represents a lane free from 
OmpT protease; and + represents a lane with the addition of OmpT protease. 
-6R: PRR-6R, -6K: PRR-6K, -6A: PRR-6A. -6N: PRR-6N. -60: PRR-6Q. -60: PRR-6D. 

The 4.9kDa peptide fragment means a peptide fragment containing GLP-1 [GJ which has been excised by 

50 OmpT protease. 

Fig. 36 is a diagrammafic illustration showing the structure of the fusion protein RShANPR encoded by 
pRShANPR, wherein -10, -5, -1 , +1 and +4 respectively show the -10, -5, -1 , +1 and 44-positions concerning the 
OmpT protease cleavage site of the fusion protein RShANR The amino acid sequences (from -l 0- to 44-positions) 
of the fusion proteins RShANP and RShANPR are shown in the figure. p-gal97S4H represents a protective protein 
55 derived from the N-temiinal 97 amino acids of E coli p-galactosidase; a-hANP represents an a-type human atrial 
natriuretic peptide; and Linker peptide represents the amino acid sequence from glutamine at the 99-position to 
arginine at the 101 -position in the amino acid sequence shown in Fig. 23. The OmpT protease cleavage site in the 
fusion protein RShANP is shown in this figure. The fusion proteins are expressed in bold figures and the substituted 
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arginines at the -6- and -4-positions are underlined. 

Rg. 37 is a diagrammatic illustration of the construction of pGRShANPR, wherein p-gal97S4H represents a region 
encoding a protective protein derived from the N-termtnal 97 amino acids of E ooli p-galactosldase; a-hANP rep- 
resents a region encoding a-type human atrial natriuretic peptide; Tc' represents a tetracydtne-resistance gene; 
5 Linlcer peptide represents a region encoding an amino add sequence QFR (SEQ ID NO:73); and lac 90 represents 
E, coli lactose promoter operator gene. 

DETAILED DESCRIPTION OF THE INVENTION 

10 [0017] Under these circumstances, the present inventors have discovered new substrate specificity profiles by 
examining amino acid sequences at cleavage sites and in the vicinity with the use of known cleavage sites from the 
viewpoint that the amino acid sequences around the cleavage sites are important in the substrate recognition and 
cleavage by OmpT protease. To apply these new substrate specificity profiles to the cleavage of fusion proteins, the 
inventors have conducted intensive studies and consequently completed the present invention. More specifically, in the 

15 method according to the present invention, use is made of the properties of OmpT protease which highly specifically 
acts so as to cleave exclusively an arginine-X bond or a lysine-X bond (wherein X is an amino acid other than glutamic 
acid, aspartic acid or proline) existing in specific amino acid sequences Including known cleavage sites and the cleav- 
age efficiency is affected by the charges on the amino acids at the -6- and -4-positions of the cleavage site. 
[0018] Accordingly, the present invention provides a method of controlling cleavage of a polypeptide by OmpT pro- 

20 tease using the properties as described above, which comprises converting the anriino acid(s) of a sequence site con- 
sisting of two arbitrary consecutive amino acids and/or amino acid(s) in the vicinity of said site in said polypeptide into 
another or other amino acid(s), characterized by (1 ) setting lysine or arginine as the amino acid at the -1 -position of said 
site and setting a specific amino acid as the amino acid at the -1-1 -position; and/or (2) setting specific amino acid(s) as 
the amino acid(s) at the -4-position and/or the -6-position from said site; so that a desired part of said polypeptide is 

25 cleavable by OmpT protease and/or an undesired part of said polypeptide is not cleavable by ompT protease. 

[0019] In one aspect, therefore, the present invention provides a method of controlling a target polypeptide by 
OmpT protease which comprises setting lysine or arginine as the amino acid at the -1 -position of a cleavage site in an 
amino acid sequence of a fusion protein containing the target polypeptide, which is cleavable by OmpT protease, and 
setting an amino acid X (wherein X is an amino acid other than glutamic acid, aspartic acid or proline) as the amino acid 

30 at the +1 -position of the corresponding amino acid sequence (hereinafter referred to as the corresponding sequence). 
In another aspect, the present invention provides a method of increasing the cleavage efficiency by excising the target 
polypeptide while setting amino acid(s) other than acidic amino acid(s) (preferably basic amino acid(s) and still prefer- 
ably lysine or arginine) as the amino acid(s) at the -6- and/or -4-positions of the cleavage site of the amino acid 
sequence cleavable by OmpT protease. 

35 [0020] When a fusion protein is expressed with the use of genetic engineering techniques and then cleaved by the 
method according to the present invention, for example, the fusion protein is expressed in such a manner as to contain 
"the corresponding sequence" at least as a part of the fusion protein and then the latter is treated by OmpT protease. 
Thus, the target polypeptide can be released. According to the present invention, moreover, the target polypeptide can 
be more efficiently excised by setting basic amino acids as the amino acids at the -6- and -4-positions of the cleavage 

40 site. The temi "target polypeptide" as used herein means any polypeptide to be expressed as a fusion protein or a 
polypeptide obtained by secretory expression, direct expression, or the like. In a case of fusion proteins to be cleaved 
by OmpT protease, for example, use may be made of polypeptides which exert physiological activity immediately after 
the cleavage by OmpT protease or as a result of post-translation modification. Alternatively, production intemnediates 
from which physiologically active peptides are fonned after further cleavage following the above-described reaction 

45 (i.e., so called precursor peptides) may be employed therefor. 

[0021] It is considered that the amino acid sequence to be cleaved by OmpT protease is defined as the sequence 
of from about -20- to +20-posltions relative to the cleavage site. Accordingly, the "corresponding sequence" as used 
herein may be selected from among the sequence ranging from about -20- to +20-positions relative to the cleavage site 
of amino acid sequences which has been already known or experimentally confirmed as cleavable by OmpT protease. 

50 For example, the "con-esponding sequence" may be selected from -20- to -1 - positions, from -20- to +1 -positions, from 
-1- to •i-20-positions or from +1- to +20-positions. 

[0022] When an amino add sequence cleavable by OmpT protease exists, the cleavage can be prevented in the 
method of the present Invention by converting the amino acid at the +1 -position of the cleavage site into glutamic add, 
aspartic acid or proline. When such a conversion of the amino acid at the +1 -position is impossible, the cleavage effi- 
55 ciency can be lowered by converting one or both of the amino acids at the -6- and -4-posltions into acidic amino add(s). 
[0023] By combining the above methods, the cleavage by OmpT protease can be controlled. 
[0024] This is particularly convenient in a case wherein a fusion protein containing a target polypeptide is produced 
in E. coli employed as a host and then the target polypeptide is excised from the fusion protein with the use of OmpT 
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protease inherently possessed by E. coS. 

[0Q25] Aocordingty» the present invention relates to the following methods. 

a) A method of controtling cleavage of a polypeptide by OmpT protease which comprises converting amino ecid(s) 
5 of a sequence site consisting of two arbitrary consecutive amino adds and/or amino add(s) in the vicinity of said 

site in said polypeptide into other amino acid(s), characterized by (1 ) setting lysine or arglnine as the amino acid at 
the -1-position relative to said site and setting a specific amino add as the amino acid at the +1 -position; and/or (2) 
setting specific amino acid(s) as the amino acid(s) at the -4-position and/orthe -6-position relative to said site; so 
that a desired part of said polypeptide is cleaved by OmpT protease and/or an undesired part of said polypeptide 
10 is not cleaved by ompT protease. 

b) The method as described in the above a) of controlling cleavage of polypeptides by OmpT protease which com- 
prises converting amino acid(s) of a sequence site consisting of two arbitrary consecutive amino acids and/or 
amino actd(s) in the vicinity of said site in said polypeptide into other amino acids, characterized by (1 ) setting lysine 
or arglnine as the amino acid at the -1 -position relative to said site and setting a specific amino acid as the amino 

IS acid at the +1 -position; and/or (2) setting specific amino acid(s) as the amino acid(s) at the -4-position and/or the - 
6-position relative to said site; so that a desired part of said polypeptide is cleaved by OmpT protease. 

c) The method as described in the above b) characterized by (1) setting an amino acid other than glutamic acid, 
aspartic acid or proline as the amino acid at the +1 -position; and/or (2) setting amino acid(s) (preferably basic 
amino acids and still preferably lysine or arginine) other than acidic amino acids as the amino acid(s) at the -4-posi- 

20 tion and/or the -6-position relative to said site. 

d) A method as described in any of the above a) to a) characterized by, in a case where the amino acid at the -1- 
position of the sequence site consisting of two arbitrary consecutive amino acids in said polypeptide is neither 
lysine nor arginine. converting said amino acid into lysine or arginine and setting an amino acid X (wherein X is an 
amino acid other than glutamic add, aspartic acid, proline, arginine, lysine, alanine, methionine or valine) as the 

25 amino acid at the +1 -position so that a desired part of in said polypeptide is cleaved by OmpT protease. 

e) The method as described in the above a) for controlling cleavage of polypeptides by OmpT protease which com- 
prises converting amino acid(s) of a sequence site consisting of two arbitrary consecutive amino acids and/or 
amino acid(s) in the vicinity of said site in said polypeptide into other amino acids, characterized by (1) setting lysine 
or arginine as the amino acid at the -1 -position relative to said site and setting a specific amino acid as the amino 

30 acid at the +^ -position; and/or (2) setting specific amino acid(s) as the amino acid{s) at the -4-position and/or the - 
6-position relative to said site; so that a undesired part in said polypeptide is not cleaved by OmpT protease. 

f) The method as described in the above e) characterized by (1) setting glutamic acid, aspartic acid or proline as 
the amino add at the -position; and/or (2) setting acidic amino acid(s) as the amino acid(s) at the -4-position 
and/or the -6-position. 

35 g) A method of applying the method as described in the above e) or f) to a case wherein a gene encoding a 
polypeptide is expressed in host cells and said polypeptide is othenvise cleaved by OmpT protease at an undesired 
part. 

h) A method of producing a polypeptide by expressing a gene encoding said polypeptide in host cells, character- 
ized by converting amino acid(s) as described in the above a) to g) in a case wherein said polypeptide is othenvise 

40 cleaved by OmpT protease at an undesired part. 

i) A method as described in any of the above a) to f) which comprises expressing in host cells a gene encoding a 
fusion protein consisting of a target polypeptide fused with a protective peptide via a cleavage site (optionally 
located in a linker peptide) and being cleavable by OmpT protease at said cleavage site, and cleaving off the protein 
at said cleavage site by OmpT protease to thereby obtain the target polypeptide from the fusion protein. 

45 j) The method as described in the above i) wherein an amino acid sequence cleavable by OmpT protease exists in 
the amino acid sequences of the protective peptide, the linker peptide and/or the target polypeptide constituting 
said fusion protein. 

k) A method of producing a target polypeptide which comprises expressing in host cells a gene, which encodes a 
fusion protein consisting of a target polypeptide fused with a protective peptide via a cleavage site (optionally 
so located in a linker peptide) and being cleavable by OmpT protease at said cleavage site, and cleaving off the protein 
at said cleavage site by OmpT protease to thereby obtain the target polypeptide from said fusion protein, charac- 
terized by using a method as described in any of the above a) to f) in converting the amino acids at the cleavage 
site and/or in the vicinity thereof. 

I) The method as described in the above k) wherein an amino acid sequence cleavable by OmpT protease exists 
55 in the amino acid sequences of the protective peptide, the linker peptide and/or the target polypeptide constituting 
said fusion protein. 

m) A method as described in any of the above g) to I) wherein the host cells is E. coli. 

n) A method as described in any of the above g) to m) wherein the target polypeptide is a natriuretic peptide. 
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[0026] Proteins and peptides to which the nrmthod according to the present invention is applicable are as follows: 
Adrenocorticotrop'c Hormone. AdrenomeduHin. Amynn. Angiotensin I, Angiotensin II. Angiotensin III, A4ype Natriuretic 
Peptide. B-type Natriuretic Peptide. Bradyldnln. Calcitonin. Calcitonin Gene Related Peptide. Cholecystokinln. Cortico- 
tropin Releasing Factor. Cortistatin. C*type Natriuretic Peptide. a-Defesin 1. P-Defesin 1, p-Defesin 2, Delta Sleep- 

5 Inducing Peptide, Dynorphin A. Elafln. a-Endorphin. P-Endorphin, r^Endorphin. Endothelln-1 . Endothelin-2. Endothelin- 
3, Big Endothelin-1 , Big EndotheIin-2, Big Endothelin-3, Enkephalin, Galanin, Big Gastrin, Gastrin, Gastric Inhibitory 
Polypeptide, Gastrin Releasing Peptide, Glirelin. Glucagon. Glucagon-like Peptide 1, Glucagon-like Peptide 2, Growth 
Hormone Releasing Factor, Growth Hormone, Guanylin, Uroguanylin, Histatin 5, Insulin, Joining Peptide, Luteinizing 
Hormone Releasing Hormone, Melanocyte Stimulating Homione, Midkine, Motilin, Neurokinin A, Neurokinin B, Neuro- 

10 medin B, Neuromedin C, Neuropeptide Y, Neurotensin, Oxytocin, Proadrenomedullin N-tenninal 20 Peptide, Parathy- 
roid Hormone, Parathyroid Homnone- Related Protein, Pituitary Adenylate Cyclase Activating Polypeptide 38, Platelet 
Factor -4, Peptide T. Secretin, Serum Thymic Factor, Somatostatin, Substance P, Thyrotropin Releasing Hormone, Uro- 
cortin, Vasoactive Intestinal Peptide, Vasopressin and the like and derivatives thereof (in the case of ANP among the 
above peptides, for example, use can be made of not only natural ANP consisting of 28 amino acids (i.e., ANP(1-28)) 

15 but also derivatives with deletion of amino acids in the amino acid sequence such as ANP(3-28) and ANP(4-28)). 
[0027] The present invention will be described in greater detail. 

[0028] pGl l7S4HompRHPR is an expression plasmid which expresses a fusion protein (PR) containing a gluca- 
gon-like peptide-1 (GLP-1 [G]). The protective protein of this fusion protein consists of a protective protein originating in 
1 17 amino acids from the N-terminus of E. coli p-galactosidase, a linker sequence consisting of 35 amino acids includ- 

20 ing an arginine-arginine sequence, and human glucagon-like peptide-1 (GLP-1 [G]). The present inventors have already 
found out that E. coli OmpT protease cleaves the central peptide bond in the arginine-arginine sequence in the linker 
sequence so as to release the target peptide consisting of 44 amino acids containing GLP-1 [G]. The present inventors 
converted the arginine-arginine sequence of the fusion protein encoded by pG117S4HompRHPR into arglnine-X 
(wherein X represents the amino add at the +1 -position of the cleavage site) by site-specific mutagenesis based on 

25 PCR and examined whether or not the thus substituted fusion protein PRX (wherein X represents one letter code of the 
amino acid (selected from 20 amino acids In total); for example, a fusion protein having a substitution into alanine is 
represented as PR A) was cleaved by OmpT protease at this site. To express each fusion protein, use was made of an 
OmpT protease-deficient E. coli strain W31 1 0 M25. Since such a fusion protein was accumulated as inclusion body in 
cells, the cells were disrupted and the inclusion body was collected by centrifugation. Then the inclusion body was sol- 
do ubilized with urea and employed in the OmpT protease reaction. The reaction was carried out by adding 20 mU of 
OmpT protease to a reaction solution containing 4 M of urea. 50 mM of sodium phosphate (pH 7.0), 2 mM of EDTA and 
each fusion protein inclusion body. The cleavage of the fusion protein was analyzed by SDS-PAGE (16%) and the N- 
temiinal amino acid sequence of the target peptide thus excised was detemiined by using a protein sequencer. 
[0029] As a result, it was clarified for the first time by the present inventors that OmpT protease has the activity of 

35 cleaving the center peptide bond in the arginine-X sequence wherein X is an amino acid other than aspartic acid, 
glutamic acid or proline. That is to say, the present inventors clarified that OmpT protease has the activity of cleaving 
not only the amino acid sequences reported so far (namely, arginine-arginine, arginine-lysine, lysine-arginine, lysine- 
lysine, arginine-alanine, arginine-methionine and arginine-valine) but also the amino acid sequences represented by 
arginine-X (wherein X is an amino acid other than aspartic acid, glutamic acid and proline). 

40 [0030] By using these fusion proteins, it was further examined whether or not the lysine-X (wherein X is located at 
the +1 -position of the cleavage site and represents alanine, serine, lysine, arginine, aspartic acid or glutamic acid) was 
cleaved and similar results were obtained thereby. It is anticipated that OmpT protease is largely affected by the amino 
acid sequence in the vicinity of the cleavage site. Therefore, the examination was further carried out to study whether 
or not fusion proteins PRhANP and PRhCT (wherein the target peptide region of the fusion protein PR was substituted 

45 respectively with a-hANP (a-type human atrial natriuretic peptide) and hCT[G] (human cateitonin precursor)) could be 
cleaved by OmpT protease. As a result, it was found out that PRhANP was cleaved by OmpT protease between 
arginine and serine and thus a-hANP was excised therefrom. On the other hand, PRhCT was not cleaved by OmpT pro- 
tease. The fact that the arginine-cysteine sequence of PRhCT was not cleaved by OmpT protease indicates that the 
recognition and cleavage of a substrate by OmpT protease are affected by the amino acid sequence in the vicinity of 

so the cleavage site, which supports the significance of using known amino acid sequences cleaved by OmpT protease as 
proposed by the present inventors. 

[0031] The above-described results were obtained by a series of studies with the use of the fusion protein PR hav- 
ing amino add sequences of known cleavage sites. Moreover, it was examined whether or not simitar results would be 
obtained by substituting the amino acid at the +1 -position of another fusion protein RShANP (i.e.. an a-hANP fusion 
55 protein having an amino acid sequence different from PR around the cleavage site). The fusion protein RShANP 
employed in this examination consists of p-gal197S, which originates in 97 amino acids from the N-temriinus of E. coli 
p-galactosidase, as a protective protein, and a-hANP bonded thereto via a linker consisting of three amino acids 
(glutamine-phenylatanine-arginine). Attempts were made to cleave fusion proteins by OmpT protease wherein the 
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amino add at the 4-1 -position of this fusion protein itad been substituted by arginine, alanine and cysteine. As a resuit, 
it was found out that these fusion proteins having been substituted the amino acid at the -i^l -position were also cleaved 
by OmpT protease to give the N-tem)inal deiivafive of a*hANR 

[0032] These results indicate that when a region having a tcnown OmpT protease-cleavage sequence (wherein the 
5 -1- and <f-1 -positions at the cleavage site are represented by Arginine-X or tysine-X) is employed and said X is substi- 
tuted by an amino acid other than aspartic add. glutamic acid or proline, the fusion protein thus substituted is stilt 
cleaved by OmpT protease. Therefore, when a fusion protein consisting of a protective peptide, the amino acid 
sequence of an OmpT protease-cleavage site and a target peptide in this order Is to be deaved by this enzyme, it is 
possible to select the amino acid added to the N-terminus of the target peptide from among amino acids other than 
10 aspartic acid, glutamic acid and proline. By using this method, a derivative having a different amino acid at the N-temii- 
nus of a target peptide can be constructed. It is also possible to substitute the N-terminal amino acid so as to increase 
the separation/purification effidency. It is also possible to convert a sequence which is not cleaved to a cleavable 
sequence. 

[0033] By using the results that a peptide having X as aspartic acid, glutamic acid or proline is not cleavable by 
15 OmpT protease, moreover, it is possible to convert a fusion protein or protein into one which cannot be digested by 
OmpT protease. More specifically, it is sometimes observed that, In the process of producing a protein expressed in E. 
coli, the target protein can be hardly isolated due to digestion by OmpT protease. In such a case, the protein can be 
converted into a fusion protein or protein by substituting the recognition amino acid at the +1 -position of the cleavage 
site into aspartic acid, glutamic acid or proline, which obviously facilitates the production. 
20 [0034] The present inventors further substituted amino acids at the -1 0- to -1 -positions of the OmpT protease cleav- 
age site of the fusion protein PRR and examined the cleavage of these fusion proteins by OmpT protease. As a result, 
they have found that the N-tenninal amino acid sequence of the cleavage site affected the cleavage efficiency. In par- 
ticular, the cleavage efficiency was elevated by setting a basic amino acid such as arginine or lysine as the amino acid 
at the •4-position but lowered by setting an acidic amino acid such as aspartic acid or glutamic acid. Similar results were 
25 obtained concerning the amino add at the -6-position. Based on these results, it is considered that OmpT protease rec- 
ognizes the electric charges of the amino acids at these positions. When arginine at the -1 -position was substituted by 
alanine, no cleavage occurred. This fact indicates again that the amino acid at this position serves an important role in 
the cleavage. 

[0035] In addition, RShANP which is an a-hANP fusion protein having a different amino acid sequence around the 
30 cleavage site fonn PRR was used, and the increase in the cleavage effidency was observed in a fusion protein 
RShANPR wherein tyrosine and alanine at the -6- and -4-positions respectively in the cleavage site had been substi- 
tuted both by arginine. 

[0036] Accordingly, the cleavage efficiency can be improved by converting either or both of the amino acids at the 
-6- and -4-positions into a basic amino acid, while the cleavage effidency can be lowered by converting these amino 
35 acids into an acidic one. That is to say, the cleavage efficiency can be controlled to a certain extent without substituting 
the amino acids at the cleavage site (i.e., -1 - or +1 -position). 

[0037] Moreover, experimental operations not described in the Examples will be first described in detail. 
(1) Materials and methods 

40 

[0038] Unless othenwise stated in the Examples, the experimental operations were carried out as follows. 
[0039] Synthesis of DNA primers was entrusted to Pharmacia. Nucleotide sequences were determined by using a 
A.LF. DNA Sequencer (manufactured by Pharmada) with the use of a Thermo Sequenase florescent labeled primer 
cyde sequencing kit with 7-deaze dGTP (manufactured by Amersham). Plasmid DNAs were isolated from E. coli by 
45 using a PI-1 ODE (manufactured by Kurabo). To cleave DNA with restriction enzymes, the reaction was carried out at 500 
to 2000 U/ml for 2 hours. The structure of a plasmid was analyzed in 10 p.) of a liquid reaction mixture with the use of 
0.5 to 1 ^g of DNA. A DNA fragment was prepared in 30 ^1 of a liquid reaction mixture with the use of 5 to 1 0 pig of DNA. 
The reaction conditions (temperature, buffer, etc.) were detemriined according to the manufacturer's instructions. Sam- 
ples for agarose gel electrophoresis were prepared by adding a 1/1 0 volume of sample buffer to the liquid reaction mix- 
so ture. As the buffer for agarose gel electrophoresis, use was made of TAE buffer (40 mM Tris-acetic acid, 1 mM EDTA). 
The electrophoresis was effected at 100 V for 30 minutes to 1 hour. After staining with an aqueous ethidium bromide 
solution, the gel was UV-irradiated to detect DNA bands. The concentration of the agarose gel was adjusted to 0.8 or 
2.0%(wA/) depending on the size of the DNA fragment to be fractionated. After the agarose gel electrophoresis, the tar- 
get DNA band was cut out and the DNA was extracted from the gel by using SUPREC-01 (manufactured by Talcara 
55 Shuzo). This DNA solution was treated with phenol/chloroform, then precipitated from ethanol and dissolved in TE 
buffer (10 mM Tris-HCI (pH 8.0), 1 mM EDTA). Ligation reaction was carried out by using Ligation high (manufactured 
by Toyobo) at a predetenmined reaction mixture composition at 16*0 for 30 minutes or overnight. PGR was earned out 
by using KOD Dash or KOD DNA polymerase (manufactured by Toyobo). The PGR conditions (temperature, buffer, etc.) 
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and the composition of the liquid reaction mixture were determined each according to the manufacturer's Instructions. 
[0040] Transformation of £ coB with a plasmid was carried out by the calcium chloride method by using JM109 
strain purchased from Talcara Shuzo as competent cells. The transformant was selected with the use of tetracycline (10 
pig/hil). Unless otherwise stated In Examples, JM1 09 was employed for the transformation of E coti, 

5 

(2) Measurement of enzymatic activity of OmpT protease 

[0041] Activity of OmpT protease was measured by using Dynorphine A as a substrate (manufactured by Peptide 
Institute Inc.). 

10 [0042] A 5 ^g aliquot of 1 mg/ml Dynorphine A was added to 40 ^1 of 50 mM sodium phosphate (pH 6.0) containing 
0.1% Triton X-100. Then 5 ^1 of a sample for measuring OmpT protease activity was added thereto and the reaction 
was initiated. The reaction was canied out at 25^C for 1 0 minutes and then stopped by adding 5 ^1 of 1 N HCL The liquid 
reaction mixture was centrifuged (1 0000 x g, 2 minutes). The supernatant was collected and a 20 ^1 portion thereof was 
analyzed by HPLC. The HPLC analysis was can'ied out by using YMC PROTEIN RP column at column temperature of 

15 40°C and at flow rate of 1 ml/min. After washing with 10% acetonitrile containing 0.1% of trifluoroacetic acid for 3 nnin- 
utes, the mixture was subjected to linear gradient elution with 10-15% acetonitrile containing 0.1% trifluoroacetic acid 
for 10 minutes. The absorption at 220 nm was monitored and thus the digestion product peptide YGGFLR was 
detected. The unit of OmpT protease activity was defined as the cleavage of 1 \imo\ Dynorphine A at 25*'C per minute 
under these conditions. 

20 

(3) SDS-polyacrylamide gel electrophoresis 

[0043] SDS-polyacrylamide gel electrophoresis was carried out by using 16% Wide-PAGEmini (manufactured by 
Tefco) as a gel, Tricine electrophoretic buffer (manufactured by Tefco) as an electrophoretic buffer, and molecular weight 
25 marker proteins (manufactured by Tefco) as molecular weight markers. The equivalent amount of 2 x SDS-PAGE sam- 
ple buffer containing 4 M urea (provided that the urea is not contained in a case of analyzing OmpT protease protein) 
was added to a sample and the mixture was heated to lOO^C for 2 minutes. Then 10 nl portion thereof was electro- 
phoresed under the conditions according to Tefco's instructions. After the completion of the electrophoresis, the gel was 
stained with a staining solution containing Coomassle Brilliant Blue R-250. 

30 

(4) Preparation of inclusion body 

[0044] Fusion proteins PRX. PKX, PRhANP. PRhCT, RShANR RXhANR PRRXA. PRR-4X, PRR-6X and 
RShANPR were each prepared as an inclusion body in the following manner. 

35 [0045] £ coli expressing each of PRX. PKX. PRhANR PRhCT, RShANR RXhANR PRRXA, PRR-4X, PRR-6X and 
RShANPR was cultured under rotation at 150 rpm, 37°C overnight in 2 I Erienmeyer flasks containing 400 ml of an LB 
liquid medium (0.5%^/v) yeast extract, 1% (w/v) tryptone, 0.5% sodium chloride) containing 10 mg/1 tetracycline. On 
the next day, the cells were collected by centrifugation (4''C, 6000 x g, 10 minutes) and disrupted by ultrasonication. 
Deionized water was added to this disrupted cell solution to give a total volume of 30 ml. Then the mixture was centri- 

40 fuged (4*'C, 25000 x g, 15 minutes) and the supernatant was discarded. The precipitate fraction (inclusion body) was 
recovered and further suspended in 30 ml of 50 mM Tris HOI (pH 8.0) containing 5 mM EDTA and 1% Triton X-100. The 
suspension was centrifuged (4''C» 25000 x g, 15 minutes). The precipitate thus obtained was suspended in deionized 
water and centrifuged (4*^0, 25000 x g, 15 minutes). Then the precipitate was recovered and deionized water was 
added thereto to give a total volume of 1.5 ml. After suspending the precipitate, the suspension was centrifuged (4^0, 

45 10000 X g, 30 minutes) to give precipitate. Then the above procedure was repeated so as to give a suspension of the 
precipitate in deionized water of ODeeo = 100 or ODgeo = 200. The inclusion bodies thus prepared were employed as 
substrates in the OmpT protease reaction. 

(5) OmpT protease reaction 

50 

[0046] By using as the substrate PRX, PKX, PRhANR PRhCT, PRRXA, PRR-4X and PRR-6X. the OmpT protease 
reaction was carried out in the following manner. To 20 ^1 of 1 0 M urea were added 2.5 ^1 of 1 M sodium phosphate (pH 
7.0) and 2 ^1 of 50 mM EDTA. Then 10 ^1 of a fusion protein inclusion body (ODeeo '^^^) added thereto and the 
inclusion body was dissolved. After adding 1 0.5 ^1 of water, 5 ^1 of 4 U/m! (20 U/ml in the case of PRR-4X, 1 U/ml in the 
55 case of PRR-6X) of OmpT protease was added thereto and the reaction was initiated at a liquid reaction mixture volume 
of 50 The reaction was canied out at 25*0 for 30 or 60 minutes. 

[0047] The peptides obtained by the OmpT protease reaction with the use of PRX. PKX, PRRXA, PRR-4X and 
PRR-6X as the substrates were each isolated and quantitated by HPLC under the conditions as specified below. To the 
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OmpT protease reaction mixture, the equivalent amount of 12% acetic add and 4 M urea were added to therekjy cease 
the reaction. Then the liquid reaction mixture was centrifuged (1 0000 x g» 2 minutes) and 20 ^1 portion or a 50 ^1 portion 
of the supernatant was treated with YMC PROTEIN RP column. HPLC was carried out at column temperature of 40*C 
at a flow rate of 1 ml/hiin. After performing linear gradient elution with 30-50% acetonitrite containing 0.1% trifluoroace- 

5 tic acid for 1 6 minutes, the absorption at 214 nm was monitored and thus the peptide was isolated and quantitated. 
[0048] By using as the substrates RShANP and RXhANR the OmpT protease reaction was carried out in the fol- 
lowing manner. To 20 nl of 10 M urea were added 2.5 nl of 1 M sodium phosphate (pH 7.0) and 2 \i\ of 50 mM EDTA. 
Then 5 jil of a fusion protein inclusion body (ODgeo = 200) was added thereto and the inclusion body was dissolved. 
After adding 15.5 ^1 of water, 5 ^1 of 10 U/ml of OmpT protease was added thereto and the reaction was initiated at 50 

10 ]i\ of a liquid reaction mixture volume. The reaction was carried out at 37''C for 120 minutes. 

[0049] By using as the substrates RShANP and RShANPR, the OmpT protease reaction was earned out in the fol- 
lowing manner. To 8 ^1 of 1 0 M urea were added 1 .0 ^1 of 1 M sodium phosphate (pH 7.0) and 0.8 ^1 of 50 mM EDTA. 
Then 4 ^1 of the fusion protein Inclusion body (ODgeo = 1 00) was added thereto and the inclusion body was dissolved. 
After adding 4.2 ^1 of water, 2 ^1 of 20 U/ml OmpT protease was added thereto and the reaction was initiated at a liquid 

15 reaction mixture volume of 20 ^1. The reaction was carried out at 25°C for 90 minutes. 

[0050] The peptides obtained by the OmpT protease reaction with the use of PRhANR RShANP. RXhANP and 
RShANPR as the substrates were each isolated and quantitated by HPLC under the conditions as specified below. To 
the OmpT protease reaction mixture, the equivalent amount of 12% acetic acid and 4 M urea were added to thereby 
cease the reaction. Then the liquid reaction mixture was centrifuged (1 0000 x g. 2 minutes) and 20 jil portion or 50 \i\ 

20 portion of the supernatant was treated with YMC A-302 ODS column. I-1PLC was earned out at a column temperature 
of 40''C and a flow rate of 1 ml/min. After perfomning linear gradient elution with 21 .5-32% acetonitrile containing 0.1% 
trifluoroacetic acid for 1 5 minutes, the absorption at 21 4 nm was monitored and thus the peptide was isolated and quan- 
titated. 

25 (6) Analysis of N-tenninal amino acid sequence of peptide 

[0051 ] The N-terminal amino acid sequence of each peptide thus obtained was determined with respect to 5 amino 
acid residues by using Protein Sequencer 477A-1 20A or PROCISE 492 (manufactured by ABI). 

30 EXAMPLES 

[0052] The present invention will be described in greater detail by reference to the following Examples. 
Example 1 : Preparation of fusion protein PRX 

35 

[0053] OmpT protease is an endoprotease which exists in E. cofi outer membrane. Although this enzyme has a 
high substrate specificity, the characteristics of the amino acid sequences in the substrate recognized by the enzyme 
have not been sufficiently clarified so far. It is known that OmpT protease cleaves the center bond of basic amino acid 
pairs (arginine-arginine, arginine-lysine. lysine-arglnine and lysine- lysine). In addition, it is reported that OmpT protease 

40 cleaves tiie C-terminat peptide bond of basic amino acid (arginine-methionine, arginine-alanine and arginine-vatine). 
However, OmpT protease does not always cleave these sites in the amino acid sequences of proteins and peptides, 
and the cleavage by this enzyme is greatly affected by the amino acid sequence in the vicinity of the cleavage site. It is 
therefore estimated that the enzyme has a high sut>strate speciticity and cleaves exclusively specific sites. The present 
inventors expected that a novel substrate specificity of this enzyme would be found by examining the amino acid 

45 sequence at the +1 -position of the cleavage site with the use of known cleavage sites of this enzyme. From this view- 
point, they conducted the following experiments. 

[0054] At the +1 -position of a fusion protein PR (i.e., a fusion protein consisting of a protective protein (p- 
gal117S4H) derived from the 117 amino acids in the N-terminus of E. coli p-galactosidase and human glucagon-iike 
peptide-1 (GLP-1 [G])) having the structure as shown in Fig. 4 which is cleavable by OmpT protease, an amino acid sub- 
50 stitution was made to thereby form fusion protein PRX (Fig. 6: wherein X represents one letter code of the amino acid 
substituted (20 types in total); namely, a fusion protein having substitution into alanine is represented as PRA). In the 
fusion protein PRX. the OmpT protease cleavage site -RLYRiRHHG- (SEQ ID NO:1)of the original fusion protein PR 
was converted into -RLYRXHHG- (SEQ ID N0:2). Then cleavage by OmpT protease was examined. 
[0055] The fusion protein PRX was prepared by the following five steps. 

55 

(1) Step 1: Construction of pG117S4HR6GLP-1 (Fig. 1) 

[0056] Rrst. piasmid pG117S4HR6GLP-1 was constructed. This plasmid carried a sequence arginine-arginine 
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which was inserted as the OmpT protease recognition/cleavage site into the linker moiety of the fusion protein. In the 
construction, the R6 synthesis DNA sequence (See Fig. 1) was inserted into the StuI site of pGl 17S4HGP (see Japa- 
nese lald-Open Patent Publication Na 9-296000 and EP 794255) to thereby gNe pG117S4HR6GLP-1. In Fig. 1, p- 
gal117S4H represents a protective proton derived from the 1 17 amino acids in the N-tenninus of £ coff ^-galectosl- 
5 dase and GLP-1[G] represents human gtucagon-lilce peptide-1 . 

(2) Step 2: Construction of pG117S4HompRHKR (Rg. 2) 

[0057] To further enhance the cleavage efficiently by OmpT protease, the sequence in the R6 moiety was modified 
10 in the following manner. A 3.2 kbp fragment (fragment 1 ) obtained by cleaving pG1 1 7S4HR6GLP-1 by Nsil and Hindlll. 
a 0.2 kbp fragment (fragment 2) obtained by pG1 1 7S4HR6GLP-1 by BamHI and Hindlll. and an LI synthesis DNA (see 
Fig. 2) encoding an amino acid sequence LI (see Fig. 2) having an arginine-arginine sequence (i.e., the recogni- 
tion/cleavage site of OmpT protease) were ligated together to thereby construct pG1 1 7S4HompRHKR. 

15 (3) Step 3: Construction of pG1 1 7S4HompRHPR (Rg. 3) 

[0058] Since a lysine-arginine (KR) sequence (corresponding to the 152- and 1 53-positions in Rg. 4) positioned 
immediately before the N-terminus of the fusion protein GLP-1 [G] expressed by pG1 1 7S4HompRHKR is cleavable by 
OmpT protease, it has been known by preliminarily experiments that this fusion protein is cleaved at two sites when 

20 treated with OmpT protease. To facilitate the analysis, therefore, this sequence was substituted by proline-arginine (PR) 
to thereby to prevent It from cleavage by OmpT protease. Primers PI :5'-GACTCAGATCTTCCTGAGGCCGAT-3' (SEQ 
ID N0:3) and P2:5'-AAAGGTACCTTCCGCATGCCGCGGATGTCGAGAAGG-3' (SEQ ID NO:4) were synthesized and 
PGR was performed with the use of pG1 17S4HompRHKR as a template to give a 0.1 kbp DNA fragment. The obtained 
fragment was treated with Bglll and SphI (fragment 3) and then ligated to a 3.2 kbp fragment (fragment 4) obtained by 

25 Cleaving pG117S4HompRHKR by Bglll and Hindlll and a 0.2 kbp fragment (fragment 5) obtained from 
pG117S4HompRHKR by SphI and Hindlll to thereby construct pGl 17S4HompRHPR. Fig. 4 shows the whole amino 
acid sequence of the fusion protein PR encoded by pGI 17S4HompRHPR. 

(4) Step 4: Construction of pGI 17ompPRX (Fig. 5) 

30 

[0059] The OmpT protease cleavage site -RLYRiRHHG- of the fusion protein PR encoded by 
pG117S4HompRHPR was converted into -RLYRXHHG- (wherein X represents an amino acid selected from the 20 
types). This conversion was earned out by introducing a mutation into pGI 17S4HompRHPR. 
[0060] The mutation was introduced by PCR by using pG1 17S4HompRHPR as a template. As primers, use was 

35 made of P3:5'-ACCCCAGGCTTTACACTTTA-3' and P4X:5'-CCGGATCCGTGATGNNNGCGATACAGGCG-3' (wherein 
X represents one letter code of the amino acid (20 types in total); and NNN represents AGO, AAC, GAG, GAT, CGG, 
GAA, CCA, CAT. GCC, AGA, GGT, GCA, GTA, GTT, CTG, GTC, TTC, TTT, ACG or ATG when conversion into alanine, 
valine, leucine, isoleucine, praline, phenylalanine, tryptophan, methionine, glycine, serine, threonine, cysteine, tyrosine, 
asparagine, glutamine, aspartic acid, glutamic acid, lysine, arginine or histidine is intended respectively). The PCR 

40 product thus obtained was digested by Pvul and BamHI to give a 0.3 kb fragment (fragment 6). Furthenmore. PCR was 
earned out by using pGI 17S4HompRHPR as a template and P5:5'-ACGGATCCGGTTCCCCTTATCGACATCCG-3' and 
P6:5'-TTGCGCATTCACAGTTCTCC-3' as primers. The PCR product thus obtained was digested by BamHI and Hindlll 
to give a 0.2 kbp fragment (fragment 7). These fragments 6 and 7 and a 3.0 kbp fragment (fragment 8) obtained by 
digesting pG1 17S4HompRHPR by Pvul and Hindlll were ligated so as to cany out transfomiation. Plasmids were iso- 

45 lated from each clone thus obtained and restriction enzyme analysis and nucleotide sequencing at the mutated site 
were performed so that it was identified as the expression plasmid of the target fusion protein PRX. These plasmids 
were collectively refen-ed to as pG117ompPRX (wherein X represents one letter code of the amino acid (20 types in 
total); namely, a fusion protein having the substitution Into alanine is expressed by pG1 17ompPRA) (Fig. 5). 

so (5) Step 5: Preparation of fusion protein PRX 

[0061 ] When pG1 1 7ompPRX is expressed in E. ooli, the fusion protein PRX (Fig. 6) is expressed as inclusion body. 
In a case where OmpT protease is expressed in E. coli, the inclusion body is cleaved by OmpT protease merely by dis- 
solving with urea. To avoid the cleavage, therefore. pG117ompPRX was transfected into W3110 M25 (i.e., an OmpT 
55 protease-deficient E. coli strain) and thus the fusion protein PRX was prepared in the form of inclusion body. 



15 



EP 1076 097 A1 



Exempted Preparation of purified OrnpT protease specimen 

[0062] To prepare purified OmpT protease, the OmpT protease expression plasmid was transfected Into onEcdi 
W3110 strain to construct an OmpT protease high-expression £ coli strain. From the membrane fraction of this strain, 
5 OmpT protease was purified by the following five steps. 

(1) Step 1: Construction of pOmpTTc (Rg. 7) 

[0083] To enhance the expression level of OmpT protease, an OmpT protease expression plasmid pOmpTTc was 
10 constructed. EcoRI- and Sall-restriction enzyme sites were introduced by site-specific mutagenesis respectively into 
immediately before the translation initiation site and the 3'-end of the OmpT gene in plasmid pGP501 (Sugimura, K. Bio- 
chem. Biophys. Res. Commun.153: 753-759, 1988) containing an OmpT protease gene. By digesting the plasmid by 
these restriction enzymes, a 1 .3 l(bp fragment (fragment 9) was obtained. 

[0064] To introduce an EcoRI restriction enzyme site into the downstream of lac promoter, PGR was carried out by 
IS using pG1 1 7S4HompRHPR as a template and P7:5'-GCGGGTGTTGGCGGGTGTCG-3' and P8:5'-TGAATTCTTCCT- 
GTGTGAAATTGTTAT-3' as primers. The PGR product thus obtained was digested by EcoRI and AlwNI to give a 0.5 
kbp fragment (fragment 10). These fragments 9 and 10 and a 2.3 kbp fragment (fragment 11) obtained from 
pG1 1 7S4HompRHPR by AlwNI and Sail were ligated to thereby construct pOmpTTc. 

20 (2) Step 2: Construction of pOmpTTcB (Fig. 8) 

[0065] In a method for constructing plasmids showing expression of proteins at high level in £. coli (Shunji Natori, 
Yoshinobu Nakanishi, Zoku lyakuhinn no Kaihatsu (Sequel To Development of Drugs), vol. 7, 29-61. 1991, Hirokawa 
Shoten), attempts were made to improve pOmpTTc In the following two points. ® To place the OmpT protease trans- 

25 lation initiation site 9 bases down stream of the SD sequence. ® To modify nucleotides so as to minimize the formation 
of stems or loops in the secondary structure of mRNA between the transcription initiation site and the fifth amino acid 
from the initiation of OmpT protease translation. After constructing pOmpTTcB (Fig. 8) by the improvement ® and 
oPmpTTcC (Fig. 9) by the improvement ®, pOmpTTcE (Fig. 10) having been subjected to both of the improvements 
® and @ was constructed. 

30 [0066] pOmpTTcB with the improvement ® was constructed in the following manner (Fig. 8). 

[0067] pOmpTTc was digested by Hindi and Mfel to give a 1.0 kbp fragment (fragment 12) and pOmpTTc was 
digested by EcoRI and Mfel to give a 2.9 kbp fragment (fragment 13). 

[0068] To perform the improvement ® , PGR was can-ied out by using pOmpTTc as a template and P9:5'-TGAAT- 
TCAAAATGCGGGCGAAAGTGCTGGG-3' and PI 0:5'-TGGCGAGGATGACGATGAGC-3' as primers. The PGR product 
35 thus obtained was digested by EcoRI and Hindi and the resulted 0.2 kbp fragment (fragment 1 4) was ligated to the frag- 
ments 12 and 13 to thereby construct pOmpTTcB. 

(3) Step 3: Construction of pOmpTTcC (Fig. 9) 

40 [0069] pOmpTTcC with the improvement ® was constructed in the following manner. 

[0070] pOmpTTc was digested by EcoRI and Sail to give a 1 .3 kbp fragment (fragment 15), and by AlwNI and Sail 
to give another 2.3 kbp fragment (fragment 16). 

[0071] To perfomi the Improvement ®, PGR was carried out by using pOmpTTc as a template and P1 1:6'- 
GTATGGTGGGGGGAGTrATG-3' and P12:5'-TGAATTCTTCCTGTGTGTAATTTTTATCCGCTCACAATT-3' as primers. 
45 The PGR product thus obtained was digested by EcoRI and AtwNI. The resulted 0.5 kbp fragment (fragment 1 7) was 
ligated to the fragments 15 and 1 6 to thereby construct pOmpTTcG. 

(4) Step 4: Construction of pOmpTTcE (Rg. 10) 

50 [0072] To improve the expression level of OmpT protease, pOmpTTcE with the improvements of ® and (2> was 
constructed in the following manner. 

[0073] The 1 .3 kbp fragment (fragment 18) obtained by digesting pOmpTTcB by EcoRI and Sail was ligated to a 
2.8 kbp fragment (fragment 19) obtained by digesting pOmpTTcC by EcoRI and Sail to thereby construct pOmpTTcE. 

55 (5) Step 5: Preparation of purified OmpT protease specimen 

[0074] To obtain a purified OmpT protease specimen, pOmpTTcE was transfected into E. coli W31 1 0 strain to 
thereby give an OmpT protease high expression E. coli strain. Next, the OmpT protease high expression E. coli strain 
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was cultured in the following manner and OmpT protease was purified. 

[0075] The W31 1 0/pOmpTTcE strain was incubated under rotation at 37^C overnight in a 500 nrtl Erienmeyer flasic 
with the use of 100 ml of a liquid LB medium containing 10 mg/l of tetracycline. On the next day, it was transferred into 
an culture vessel equipped with a stirrer containing a medium 21 containing 4 g/l K2HP04. 4 g/l KH2PO4. 2.7 g/1 

5 Na2HP04, 0.2 g/l NH4CI. 1 .2 g/l (NH4)2S04. 4 g/l yeast extract, 2 g/l MgS04 • 7H2O, 40 mg/l CaCIs • 2H2O, 40 mg/l 
FeS04-7H20. 10 mg/l MnS04»nH20, 10 mg/I AICl3»6H20, 4 mg/l CoCl2-6H20, 2 mg/l ZnS04»7H20, 2 mg/l 
Na2Mo04 • 2H20, 1 mg g/l CUCI2 • 2H20, 0.5 mg/I H3BO4. 1 g/l glucose, 1 0 g/l glycerol and 1 0 mg/l tetracycline and cul- 
tivated therein at 37°C for 12 hours. After the completion of the cultivation, the culture medium was centrifuged (4**C, 
6000 X g, 10 minutes) to give 80 g of packed cells. These cells were suspended in 600 ml of 50 mM Tris-HCI (pH 7.5) 

10 and the suspension was centrifuged (4^C, 6000 x g, 1 0 minutes) to thereby collect the cells. After repeating this proce- 
dure, the cells were suspended in 600 ml of 50 mM Tris-HCt (pH 7.5) and disrupted with a Manton-Gorlln. The suspen- 
sion of the disrupted cells was centrifuged (4^C. 1000 x g, 10 minutes) and the precipitate was discarded and the 
supernatant was recovered. The supernatant was further centrifuged (4*'C, 36000 x g, 40 minutes). The precipitate was 
recovered, suspended in 150 ml of SO mM Tris-HCI (pH 7.5) and centrifuged again (4°C, 36000 x g. 40 minutes). To a 

75 1/6 portion of the precipitate thus obtained was added 120 ml of 50 mM Tris-IHCI (pH 7.5) containing 0.1% salcosyl. 
After suspending, the mixture was shaken at 10**C for 1 hour. After 1 hour, it was centrifuged (4''C, 36000 x g. 40 min- 
utes) and the precipitate was recovered. Further, it was suspended in 120 ml of 50 mM Tris-HCI containing 0.1% of Tri- 
ton X-100 and 5 mM of EDTA and shaken at room temperature for 1 hour. Next, it was centrifuged (4°C, 36000 x g, 40 
minutes) and the supernatant was recovered to give a crude enzyme specimen. 

20 [0076] 120 ml of this crude enzyme specimen was applied onto Benzamidine Sepharose 6B Column (12 mm in 
diameter x 70 mm, 8 ml) having been equilibrated with 50 mM Tris-HCI (pH 7.5) containing 0.1% Triton X-100 (herein- 
after refen'ed to as the buffer A) at a flow rate of 4 ml/min and then washed with 80 ml of the buffer A. Then the column 
was eluted with the buffer A containing 0.3 M NaCI and the eluate was taken up in 10 ml portions to give 8 fractions in 
total. 

25 [0077] The fifth fraction, which was proved as being homogeneous by 1 6% SDS-PAGE, was referred to as a puri- 
fied OmpT protease specimen. When determined by using a Coomassie Plus Protein Assay Reagent (manufactured 
by PIERCE) and bovine serum albumin as a standard, the protein concentration of the purified OmpT protease speci- 
men was 120 (ig/mi. The OmpT protease activity measured by using Dynorphine A as a substrate was 40 U/ml. 

30 Example 3 : Cleavage of PRX by OmpT protease 

[0078] It was examined whether or not the fusion protein PRX (Fig. 6), whteh was constructed by substituting the 
amino acid at the +1 -position (141 -position from the N-terminus) of the fusion protein PR having a structure cleavable 
by OmpT protease (Fig. 4), was deaved by OmpT protease. PRX was reacted with the purified OmpT protease speci- 
35 men at pH 7.0 for 30 minutes at 25®C. After the enzymatic reaction, SDS-PAGE analysis was perfonmed. Fig. 1 1 shows 
the results wherein - represents a lane free from OmpT protease, and + represents a lane of fusion protein treated with 
OmpT protease. 

[0079] In PRD and PRE, no cleavage by OmpT protease was observed (Fig. 1 1 , lanes D and E). In contrast, cleav- 
age by OmpT protease was observed in the proteins other than PRD and PRE. 
40 [0080] To identify the cleavage site, peptide digestion products obtained after the OmpT protease treatment were 
isolated by HPLC and the N-terminai amino acid sequences were determined. The OmpT protease cleavage sites thus 
identified are listed in Table 1 . 



Table 1 



OmpT protease cleavage site of fusion protein PRX 


PRX 


Cleavage site 


PRA 


LYRMHHGSG (SEQ id NO:16) 


PRV 


LYRiVHHGSG (SEQ ID N0:17) 


PRL 


LYRiLHHGSG (SEQ ID N0:18) 


PRI 


LYRilHHGSG (SEQ ID N0:19) 


PRP 


LYRPHHGSG (SEQ ID NO:20) 


PRF 


LYRiFHHGSG (SEQ ID N0:21) 


PRW 


LYRiWHHGSG (SEQ ID NO:22) 
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Tfible 1 (continued) 





OmpT protease cleavage site of fusion protein PRX 




PRX 


Cleavage site 


5 


PRM 


lyrXmhhgsg rsPQ in ^lo■23^ 




PRG 


LYRlfiHhlG^G f^PO !n M^-9d^ 




PRS 


LYRlSHHfi^fi ^QPO ir^ Mn*9(>^ 


10 


PRT 




PRC 


LYRiCHHGSG (SEQ ID NO:27) 




PRY 


LYRiYHHGSG (SEQ ID NO:28) 




PRN 


LYRiNHHGSG (SEQ ID NO:29) 


15 


PRO 


LYRJ.QHHGSG (SEQ ID NO:30) 




PRD 


LYRDHHGSG (SEQ ID N0:31) 




PRE 


LYREHHGSG (SEQ ID NO:32) 


20 


PRK 


LYRiKHHGSG (SEQ ID NO:33) 




PRR 


LYRiRHHGSG (SEQ ID NO:34) 




PRH 


LYRiHHHGSG (SEQ ID NO:35) 


25 


i stands for an OmpT protease cleavage site. The region from leucine at the 138- 
position (from the N-terminus) to glycine at the 146-position In the fusion protein 
PRX is shown as the amino acid sequence at the cleavage site. 



[0081] In PRR, cleavage was observed not at -RX- but exclusively at -ELRiLYRPHHG-. In alt of the proteins other 
30 than PRD, PRE and PRR however, cleavage was observed at HiX- (Table 1). Based on these results, it is assumed 
that amino acid sequences having one of the 17 amino acids [namely, those other than aspartic acid and glutamic acid 
(acidic amino acids) and proline (an imino acid)] at the -t-l -position are cleavable by OmpT protease. 

Example 4 : Preparation of fusion protein PKX 

35 

[0082] Similarly, to examine whether or not the cleavage by OmpT protease can be performed after substituting the 
amino acid at the +1 -position by one of the amino acids other than aspartic acid and glutamic acid (acidic amino acids) 
and proline also in the case when the OmpT protease cleavage site has a lysine as the basic amino acid at the -1 -posi- 
tion, PKX wherein -RLYRXHHG- of the fusion protein PRX was converted into -RLYKXHHG-, was constructed (Fig. 13). 

40 In the following example, X was selected from alanine, serine, lysine, arginine, aspartic acid and glutamic acid (repre- 
sented in one letter code of each amino acid). Among these amino acids, lysine and arginine, which fomried a basic 
amino acid pair, were employed as a positive control. Alanine and serine were employed because they showed rela- 
tively high cleavage efficiency in Example 3. Aspartic acid and glutamic acid were employed to examine whether PKD 
and PKE were cleavable or not, since PRD and PRE containing these amino acids were not cleavable. 

45 [0083] The fusion protein PKX was prepared by the following two steps. 

(1 ) Step 1 : Construction of pG1 1 7ompPKX (Fig. 12) 

[0084] The plasmid pG117ompPKX (wherein X is A, S, K, R, D or E) (Fig. 12) encoding the fusion protein PKX 

50 (wherein X is A, S, K, R, D or E) (Fig. 13) was fonned in the following manner. 

The conversion of -RLYRXHHG- into -RLYKXHHG- (wherein X is A, S, K, R, D or E) was conducted by PCR. 
[0085] PCR was earned out by using pG117ompPRA. pG117ompPRS. pG117ompPRK, pG117ompPRR, 
pG117ompPRD and pG117ompPRE as templates and P3:5'-ACCCCAGGCTTTACACTTTA-3' and P13X:5'-CCG- 
GATCCGTGATGNNNTTTATACAGGCG-3' as primers (NNNrAGC in case of using pGI 17ompPRA as a template; AG A 

55 in case of using pGI 1 7ompPRS; TTT in case of using pG1 1 7ompPRK: ACG In case of using pG1 1 7ompPRR; GTC in 
case of using pG1 17ompPRD; and TTC in case of using pG1 1 7ompPRE). A 0.3 kbp fragment (fragment 20) obtained 
by digesting the PCR product obtained above by Pvul and BamHl was ligated to a 0.1 kbp fragment (fragment 21) 
obtained by digesting pG117ompPRR by BamHl and Sail and a 3.1 kbp fragment (fragment 22) obtained by 
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pG1 1 7ompPRR by Pvul and Sail, and transfbnmation was carried out The plasmids were isolated from each clone thus 
obtained and restriction enzynie analy»s and nucleotide sequencing at the mutated site were performed in order to 
confirm that the plasmids express the target fusion proteins. These plasmids were coltecth^ly referred to as 
pQ1 1 7ompPKX (wherein X represents one tetter code of the amino add substituted, for example, pG 1 1 7ompPKA rep- 
5 resents a plasmid with the substitution into alanine). (Rg. 12). 

(2) Step 2: Preparation of fusion protein PKX 

[0086] When pGIIZompPKX is expressed in E. coli, the fusion protein PKX (Fig. 13) is expressed as inclusion 
70 body. In a case wherein OmpT protease is expressed in E. coli, the inclusion body is cleaved by OmpT protease merely 
by solubilizing with urea. To avoid this phenomenon, pG1 1 7ompPKX was transformed into the OmpT protease-deficient 
£ coli strain W31 10 IVI25 and thus the fusion protein PKX was prepared as inclusion body. 

Example 5 : Cleavage of PKX by OmpT protease 

15 

[0087] It was examined whether or not the fusion protein PKX (Fig. 1 3) could be cleaved by OmpT protease. 
[0088] PKX was reacted with the purified OmpT protease specimen at 25*'C for 30 minutes. Fig. 14 shows the 
results of SDS-PAGE analysis thereof wherein - represents a lane free from OmpT protease; and + represents a lane 
of fusion protein treated with OmpT protease. 

20 [0089] It was confirmed that PKK and PKR employed as positive controls were cleaved by OmpT protease (Fig. 1 4, 
lanes KK and KR). Also, PKA and PKS fomning no basic amino acid pair were cleaved by OmpT protease (Fig. 14, lanes 
KA and KS). In contrast, PKD and PKE were not cleaved by OmpT protease (Fig. 14, lanes KD and KE). 
[Q090] To identify the cleavage sites, the peptide digestion products were isolated by HPLC after the OmpT pro- 
tease treatment and the N-temiinal amino acid sequences were then detemriined. The OmpT protease cleavage sites 

25 thus identified are listed in Table 2. 

[0091] PKK, PKR, PKA and PKS, which was cleaved by OmpT protease, showed cleavage at -KiX- (Table 2). 



Table 2 



30 


OmpT protease cleavage site of fusion protein PKX 




PKX 


Cleavage site 




PKA 


LYKiAHHGSG (SEQ ID NO:38) 


35 


PKS 


LYKiSHHGSG (SEQ ID NO:39) 




PKK 


LYKiKHHGSG (SEQ ID NO:40) 




PKR 


LYKiRHHGSG (SEQ ID N0:41) 




PKD 


LYKDHHGSG (SEQ ID NO:42) 


40 


PKE 


LYKEHHGSG (SEQ ID NO:43) 




i stands for an OmpT protease cleavage site. The region from leucine at the 138- 
position (from the N-terminus) to glycine at the 146-position in the fusion protein 
PKX is shown as the amino acid sequence at the cleavage site. 



45 



[0092] Accordingly, the results of Examples 3 and 5 indicate that there exist OmpT protease cleavage sites not only 
in the case of basic amino acid pairs (-RR-,-RK-,-KR- and -KK-) but also in the case of pairs including one basic amino 
acid (-RX-,-KX-). When X is aspartic acid or glutamic acid (an acidic amino acid) or proline (an imino acid), however, no 
so cleavage arises in this site. 

Example 6 : Preparation effusion proteins PRhANP and PRhCT 

[0093] The results of Example 3 indicated that OmpT protease can cleave -RiX- (wherein X represents an amino 
55 acid (17 types in total) other than aspartic acid, glutamic acid (acidic amino acids) and proline) in the amino acid 
sequences in the vicinity of the OmpT protease cleavage sites shown in Example 3. Also, similar results were obtained 
concerning the cleavage at -KiX- in Example 5. 

[0094] With respect to the recognition of substrate by this enzyme, however, it seems insufficient to merely examine 
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on the amino add sequences reported hitherto and the amino acid sequences in the vicinity of the deavage sites (le., 
in the N- and C-terminal sides) are also important tn the above Examples, the present inventors substituted the amino 
add at the -t-l -position of the cleavage site by this enzyme. In view of the fact that the amino add sequences in the vidn- 
ity of the deavage sites might be important in the substrate recognition and cleavage, the present inventors further 
5 examined how the deavage by OmpT protease occuired in the case of the target peptide moiety of the fusion proteins 
employed into Examples 3 and 5 was substituted with other peptides (i.e., altering the amino acid sequence in the C- 
terminal side from the amino acid at the +1 -position). 

[0095] A fusion protein PRhANP (Fig. 1 6), wherein a-hANP (a-type human atrial natriuretic peptide) was arranged 
following argmine at the 140-position from the N-tenminus of the fusion protein PR (Fig. 4), and another fusion protein 
10 PRhCT (Rg. 1 8), wherein hCT[G] (human calcitonin precursor) was provided, were constructed and reacted with OmpT 
protease so as to examine whether or not a-hANP and hCT[G] could be excised 

[0096] An expression plasmid pG1 17ompPRhANP of the fusion protein PRhANP and an expression plasmid of 
pGI 17ompPRhCT of the fusion protein PRhCT were constructed by using pGIISompPRR (Fig. 5). The fusion proteins 
PRhANP and PRhCT were prepared by the following three steps. 

15 

(1) Step 1: Construction of pG117ompPRhANP(Fig. 15) 

[0097] The expression plasmid pGI 17ompPRhANP of the fusion protein PRhANP(Fig. 16), wherein a-hANP was 
arranged following arginine at the 140-position of the N-temninus of the fusion protein PR (Fig. 4), was constructed. 

20 PCR was carried out by using pGHa97SII ("Daichokin o shukushu toshita setrikassei peputido seisankei ni kansuru 
kenkyu (Study on Physiologically Active Peptide Production System with the Use of E. coli as i-tost)', Koji Magota, Doc- 
toral Dissertation, Kyushu University. 1991) as a template P14:5'-GCGGAGCTCCGCCTGTATCGCAGCCTGCG- 
GAGATCCAGCTG-3' and P15:5'-CTGAGTCGACTCAGTACCGG-3' as primers. The PCR product thus obtained was 
Isolated and digested by Sad and Sail. The 6.1 kbp fragment (fragment 23) thus obtained was ligated to a 3.4 kbp frag- 

25 ment (fragment 24) obtained by digesting pG1 1 7ompPRR by Sad and Sail and transformation was performed. A plas- 
mid was isolated from each clone thus obtained and restriction enzyme analysis and nucleotide sequencing at the 
mutated site were performed so that it was identified as the expression plasmid of the target fusion protein. This plasmid 
was referred to as pG117ompPRhANR 

30 (2) Step 2: Construction of pGI 1 7ompPRhCT (Rg. 1 7) 

[0098] An expression plasmid pG117ompPRhCT of the fusion protein PRhCT (Fig. 18) wherein hCT[GJ was 
arranged following arginine at the 140-position from the N-terminus of the fusion protein PR (Fig. 4) was constructed. 
PCR was earned out by using pG97S4DhCT[G]R4 (Yabuta, M., Suzuki, Y. and Ohsuye, K. Appl. Microbiol. Blotech- 

35 nol.42: 703-708, 1995) as a template and P16:5'-GCGGAGCTCCGCCTGTATCGCTGTGGTAACCTGAGCACCTG-3' 
and P17:5'-CTGAGTCGACTTAGCCCGGG-3' as primers. The PCR product thus obtained was digested by Sad and 
Sail. The 0.1 kbp fragment (fragment 25) thus obtained was ligated to a 3.4 kbp fragment (fragment 26) obtained by 
digesting pGI 17ompPRR using Sad and Sail and transformation was carried out. The plasmid was isolated from each 
clone thus obtained, and the restriction enzyme analysis and the nucleotide sequencing at the mutated site were per- 

40 formed so that it was identified as the expression plasmid of the target fusion protein. This plasmid was referred to as 
pG117ompPRhCT 

(3) Step 3: Preparation of fusion proteins PRhANP and PRhCT 

45 [0099] pG117omp PRhANP and pGl 17ompPRhCT prepared above were transformed into the OmpT protease- 
deficient E. coli strain W31 10 M25 to thereby give fusion protein-producing strains. By cultivating these strains, the 
fusion proteins PRhANP (Fig, 16) and PRhCT (Fig. 18) were prepared as an inclusion body. 

Example 7 : Cleavage of PRhANP and PRhCT by OmpT protease 

so 

[0100] It was examined whether or not the fusion proteins PRhANP (Fig. 1 6) and PRhCT (Fig. 1 8) could be cleaved 
by OmpT protease. 

[0101 1 PRhANP and PRhCT were reacted with the purified OmpT protease specimen at 25'C for 30 minutes at pH 
7.0. The figure 19 shows the results of SDS-PAGE analysis wherein - represents a lane free from OmpT protease; and 
55 + represents a lane with the addition of OmpT protease. As a result, PRhANP was cleaved by OmpT protease (Fig. 1 9, 
lane a-hANP), while PRhCT was not cleaved thereby (Fig. 19, lane hCT). 

[0102] Furthermore, in order to identify the cleavage site of PRhANP, the peptide digestion product after the OmpT 
protease treatment was isolated by H PLC and the N -terminal amino acid sequence was detemitned. Thus, it was con- 
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firmed that a-hANP had been excised. 

[0103] Taking the resuKs of this Example into considerBtion. it is obvious that a-hANP having serine as the N-ter- 
minal amino add was excised from the fusion protein employed, while hCT[G] having cysteine as the N-temrtlnal amino 
acid was not cleaved. In the results of Example 3, PRC having cysteine at the -i-l -position could be cleaved by OmpT 
5 protease but hCT[G] was not cleaved. Therefore, it has been confinmed that the cleavage by this enzyme does not 
depend merely on the amino acid sequence at the -1- and -positions of the cleavage site but is largely affected by 
the amino acid sequence in the vicinity of the cleavage site. 

Example 8 : Preparation of fusion protein RShANP 

10 

[0104] The fusion protein RShANP (Fig. 21) encoded by the expression plasmid pGRShANP (Fig. 20) is a fusion 
protein wherein p-gal97S originating in 97 amino acids from the N-terminus of E. coli p-galactosidase, serving as a pro- 
tective protein, is iigated to a-hANP via a linker consisting of three amino acids (glutamine-phenylalanine-arginine). In 
the course of studies on OmpT protease, the present inventors found out that the fusion protein RShANP is cleaved by 
15 OmpT protease at the bond between arginine in the linker sequence and serine at the N-termlnus of a-hANP. The 
fusion protein RShANP was prepared by the following two steps. 

(1) Construction of pGRShANP (Fig. 20) 

20 [0105] The pGHa97Sll is a plasmid constructed as a p-gal97S/a-hANP fusion protein expression plasmid. A plas- 
mid pGRShANP expressing the fusion protein RShANP, wherein lysine located immediately before the N-terminal ser- 
ine of the fusion protein a-hANP expressed by pGHa97Sll was converted into arginine. was constructed in the following 
manner. 

[0106] PGR was carried out by pGHa97Sll as a template and PI 8:5'-TACGATGCGCAATTCCGTAGCCTGCGG-3* 
25 and PI 9:5'-TGCCTGACTGCGTTAGCAATTTAACTGTGAT-3' as primers and thus 0.2 kbp PGR product (P20) wherein 
lysine located as the amino acid codon Immediately before the N-temninal serine of a-hANP had been converted into 

arginine was obtained. 

[0107] Then, PGR was carried out again by using the thus obtained PGR product (P20) and P21:5'- 
TTATCGCCACTGGCAGCAGC-3* as primers and pGHa97SII as a template to thereby give a 1.0 kbp PGR product con- 
30 taining a linker DNA sequence with the substitution by arginine. This 1 .0 kbp PGR product was digested by Bglll and 
EcoRl and thus a 0.2 kbp DNA fragment (Fragment 27) was isolated. The a-hANP expression plasmid pGHa97Sll was 
digested by Bglll and EcoRI and the 3.0kbp fragment (fragment 28) thus obtained was Iigated to the fragment 27, 
thereby constructing pGRShANR 

35 (2) Step 2: Preparation of fusion protein RShANP 

[0108] When pGRShANP is expressed in E coli, the fusion protein RShANP (Fig. 21) is expressed as inclusion 
body. In a case wherein OmpT protease is expressed in E. coli, the inclusion body is cleaved by OmpT protease merely 
by solubilizing with urea. To avoid this phenomenon, pGRShANP was transformed into the OmpT protease-deficlent E. 
40 coli strain W31 1 0M25 and thus the fusion protein RShANP was prepared as an inclusion body 

Example 9 : Cleavage of RShANP by OmpT protease 

[0109] The physiologically active peptide a-hANP was excised from the fusion protein RShANP (Fig. 21 ) in the fol- 
45 lowing manner. RShANP was reacted with OmpT protease at pH7.0 at 37''G for 2 hours followed by SDS-PAGE analy- 
sis. The result is shown in Fig. 24 (lane RS) wherein - represents a lane free from OmpT protease; and + represents a 
lane with the addition of OmpT protease. Based on this result, it was confirmed that RShANP could be cleaved by 
OmpT protease. To identify the cleavage site, the peptide digestion product obtained after the OmpT protease reaction 
was isolated by HPLC and the N-terminal amino acid sequence was determined. The OmpT protease cleavage site 
so thus identified is listed in Table 3 (RShANP). As Table 3 shows, RShhANP was cleaved by OmpT protease at - 
AOFRXsLRR- and thus the physiologk:aily active peptide a-hANP was directly excised. Also, cleavage at - 
AQFRSLRlR-was partly detected and the excision of a-hANP(3-28) was also confirmed. 



55 
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Table3 





OmpT protease cleavage sites of f udon proteins RShANP and RXhANP 


5 


RXhANP 


Cleavage site 




RShANP 


QFRiSLRRS (SEQ ID NO:52) 




RRhANP 


QFRiRLRRS (SEQ ID NO:53) 


10 


RAhANP 


QFRiALRRS (SEQ ID NO:54) 




RChANP 


QFRiCLRRS (SEQ ID NO:55) 


IS 


i stands for an OmpT protease cleavage site. The region from glutamine at the 99-posi- 
tion (from the N-terminus) to serine at the 106-posltion in the fusion protein RShANP or 
RXhANP is shown as the amino acid sequence at the cleavage site. 



Example 10 : Preparation of fusion protein RXhANP 

20 [0110] In case of using the fusion protein PRX as a substrate, it was confinned that the amino acid sequence - 
RLYRXHHG-(whereln X represents an amino acid selected from the 20 types) having as X an amino acid other than 
aspartic acid and glutamic acids (i.e., acidic amino acids) and proline (i.e., an imino acid) was cleavable by OmpT pro- 
tease. Thus, it was examined whether or not cleavage similar to PRX arose in the OmpT protease cleavage site of other 
amino acid sequences. First, an expression plasmid pGRXhANP (wherein X is R, A or C) of the fusion protein RXhANP 

25 (wherein X is R, A or C), which had been constructed by transfemng mutation into the expression plasmid pG RShANP 
(Fig. 20) to thereby convert the OmpT protease cleavage site -AQFRiSLRR- into -AQFRXLRR- (wherein X is arginine, 
alanine or cysteine), was constructed (Fig. 22). With respect to X, arginine was selected as a positive control forming a 
basic amino acid pair. Alanine and cysteine were selected because they provided relatively high cleavage efficiency in 
Example 3. RXhANP (Fig. 23) was constructed by the following two steps. 

30 

(1) Step 1: Construction of pGRXhANP (Fig. 22) 

[0111] pGRXhANP was constructed by introducing a mutation into pGRShANR pGRShANP was employed as a 
template, while P3:5'-ACCCCAGGCTTTACACTTrA-3' and P22X:5'-TCTCCGCAGNNNACGGAATTGCGCATCGTA-3' 

35 (NNN:AGC in case of converting into alanine; GCA in case of converting the resine into cysteine; and AGG in case of 
converting into arginine) were employed as primers. From the PGR product thus obtained, the target PGR product was 
isolated (PGR product 29). Similarly, PGR was canried out by using pG RShANP as a template and P23X:5'-CAATTG- 
CGTNNNGTGGGGAGATGGAGGTGC-3' (NNNiGGT in the case of converting into alanine; TGC in the case of convert- 
ing into cysteine; and CGT in the case of converting into arginine) and P24:5'- 

40 GGGTG AGTGGGTTAGCAATTTAAGTGTGAT-3' as primers and the target PGR product (PGR product 30) was isolated. 
PGR was carried out by using the PGR products 29 and 30 obtained above as templates and P3:5'-ACCCCAGGCTT- 
TAGACTTTA-3' and P24:5'-GCCTGACTGCGTTAGCAATTTAACTGTGAT-3' as primers. The PCR product was collected 
and digested by EcoRI and Bgll I to thereby isolate a 0.2 kbp DNA fragment (fragment 31 ). This fragment 31 was ligated 
to a 30 kbp fragment (Fragment 32), which had been obtained by digesting pGRShANP with EcoRI and Bglll, and trans- 

45 formation was performed. A plasmid was isolated from each clone thus obtained and restriction enzyme analysis and 
nucleotide sequencing at the mutated site were perfomned so that it was identified as the expression plasmid of the tar- 
get fusion protein. This plasmid was refen-ed to as pGRXhANP (wherein X represents one letter code of the amino acid 
converted; namely, a fusion protein having substitution into alanine is expressed as pG RAhANP). 

so (2) Step 2: Preparation of fusion protein RXhANP 

[0112] When pGRXhANP is expressed In E. coli, the fusion protein RXhANP (Fig. 23) is expressed as inclusion 
body. In a case where OmpT protease is expressed in E oo//, the inclusion body is cleaved by OmpT protease merely 
by dissolving with urea. To avoid this phenomenon, therefore, pGRXhANP was transformed into the OmpT protease- 
55 deficient B. coli strain W31 1 0 M25 and thus the fusion protein RXhANP was prepared in the fonn of inclusion body. 
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Example 11: Cleavage of RXhANP by OmpT protease 

[0113] The fusion protein RXhANP (Rg. 23). wherein the OmpT protease -AQFRiSLRR- of the fusion protein 
RShANP (Rg. 21) had been converted Into -AQFRXLRR-(wherein X Is arginlne, alanine or cysteine) was treated with 

5 OmpT protease at pH7.0, 37^0 for 2 hours. Rg. 24 shows the result of SDS-PAGE analysis. 

[0114] Similarly to the cases of using PRR, PRA and PRC as a substrate, cleavage by OmpT protease was 
observed In RRhANR RAhANP and RChANR To Identify the cleavage sites, the peptide digestion products were iso- 
lated by HPLC after the OmpT protease treatment and the N-terminal amino acid sequences were detemiined. The 
OmpT protease cleavage sites thus identified are listed in Table 3. As Table 3 shows, RRhANR RAhANP and RChANP 

10 were each cleaved by OmpT protease at -AQFRK>lXLRR-. Furthermore, taking the results In the fusion protein PRX 
Into consideration too, it is suggested that there exist OmpT protease cleavage sites comprising one basic amino acid 
In the case wherein the amino acid sequence in the vicinity the OmpT protease cleavage site is altered. 
[0115] In this Example, arginine-argimne alone was slightly cleaved, among the four sites (arginine-arginine. 
arginine-methionine, arginine-isoleucine and arginine-tyrosine) existing in a-hANP molecule consisting of 28 amino 

15 acids, while the other bonds were scarcely cleaved. These facts suggest that the cleavage sequences reported hitherto 
and the sequences proposed by the present inventors (i.e., arginine-X or tysine-X wherein X is one of the 17 amino 
acids other than aspartic acid and glutamic acid (acidic amino acids) and proline (an imino acid)) alone are not sufficient 
to be cleaved by OmpT protease. That is to say, it is indicated that a method for creating a novel cleavage site by utilizing 
a region containing the known cleavage sites of this enzyme, as performed by the present inventors, is industrially use- 

20 ful. 

Example 12 : Preparation of fusion protein PRRXA 

[01 1 6] As described above, it is pointed out that OmpT protease cleaves the central bond of arginine-X or tysine-X 
25 wherein X is one of the 17 amino acids other than aspartic acid and glutamic acid (acidic amino acids) and proline (an 
imino acid). However, OmpT protease does not always cleave all of these bonds In proteins and peptides. It is estimated 
that the cleavage by OmpT protease is largely affected by the amino acid sequences in the vicinity of the cleavage sites. 
It is rather considered that this enzyme has high substrate specificity because it cleaves exclusively specific sites. The 
present inventors expected that the substrate specificity of this enzyme might be further clarified by examining the effect 
30 of the amino acid sequence in the N-tennlnal side of the known cleavage site of the enzyme and, therefore, conducted 
the following experiment. 

[0117] The fusion protein PRR (shown in Rg. 25) having a structure cleavable by OmpT protease consists of a pro- 
tective protein (P-gal1 1 7S4H) originating in 1 17 amino acids from the N-terminus of E coli P-galactosidase and human 
glucagon-like peptide-1 (GLP-1[G]). As shown in Rg. 25, a fusion protein PRRXA (wherein X corresponds to the posi- 

35 tion of the amino acid of the cleavage site and represented in -1 , -2 1 0 excluding -7), wherein an amino acid in the 

amino acid sequence at the -10- to -1 -positions (I.e., GYDAELRLYR) of the OmpT protease cleavage site -QMHGY- 
DAELRLYRiRHHG- existing in the linker peptide of the fusion protein PRR had been converted one by one into alanine, 
was prepared and the cleavage of these fusion proteins by OmpT protease was examined. 
[0118] The fusion protein PRRXA was prepared by the following two steps. 

40 

(Step 1) Construction of pG1 17ompPRRXA ( Figs. 26, 27, 28. 29) 

[0119] The expression plasmid of the fusion protein PRRXA was referred to as pG1 1 7ompPRRXA (corresponding 
to the fusion protein PRRXA). However, alanine at the -7-position was not substituted. These substitutions were carried 

45 out by introducing DNA mutations into pG1 1 7ompPRR. 

[0120] The PGR method was employed in order to induce mutation. As shown in Fig. 26, the plasmids 
pGI 17ompPRR-2A, pG1 17ompPRR-3A and pG1 1 7ompPRR-4A were constructed by using pGI 17ompPRR as a tem- 
plate and PIG: 5'-TGCCGAGGATGACGATGAGC-3\ P25: 5'-GGGGAGCTCCGCCTGGCTCGCCGTCATCAC-3'. P26: 
5'-GCGGAGCTCCGCGCTTATCGCCGTCATCAC-3' and P27: 5*-GCGGAGCTCGCTCTGTATCGCCGTCATCAC-3' as 

50 primers. The PGR products obtained by using the combinations of the primers P10/P25. P10/P26 and P10/P27 were 
digested by Sad and Kpnl to give each 0.1 kbp fragment (fragment 33) in each combination. Separately, 
pG1 1 7omoPRR vras digested by Bglll and SacI to give a 0.2 kbp fragment (fragment 34). These fragments 33 and 34 
were ligated to a 3.2 kbp fragment (fragment 35) obtained by digesting pG1 1 7ompPRR by Bglll and Kpnl and transfor- 
mation was carried out. A plasmid was isolated from each clone thus obtained. 

55 [0121] As shown in Fig. 27, expression plasmids pGI 17ompPRR-5A and pG1 1 7ompPRR-6A were constructed by 
using the pGI 17ompPRR as a template and PIC, P28: 5'-CAGATGCATGGTTATGACGCGGAGGCTCGC-3', and P29: 
5'-CAGATGCATGGTTATGACGCGGCTCTCCGC-3' as primers. The PGR product obtained by using the combinations 
of the primers P10/P28 and P10/P29 was digested by Nsil and Kpnl to give a 0.1 kbp fragment (fragment 36) in each 
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combination. Separately. pG117ompPRR was digested by Nsil and Kpnl to give a 3.4 Icbp fragment (fragment 37). 
These fragments 36 and 37 were ligated together and transfomfiation was carried out A ptasmid was isolated from each 
clone thus obtained. 

[0122] Further, expression plasmlds pG1 1 7ompPRR-aA, pG1 1 7ompPRR-9 and pGl 1 7onr!pPRR-1 OA were con- 
5 structed as shown in Rg. 28. pG1 1 7ompPRR was used as a template and P3: 5'-ACCCCAGGCTTTACACTTTA-3' P30: 
5*-GCGGAGCTCCGCAGCATAACCATGCATCTG-3', P31: 6'-GCGGAGCTCCGCGTCAGCACCATGCATCTG-3' and 
P32: 5'-GCGGAGCTGCGCGTCATAAGCATGCATCTG-3* were used as primers. The PGR products obtained by the 
combinations of the primers P3/P30, P3/P31 and P3/P32 were digested by SacI and BglH to give a 0.2* kbp fragment 
(fragment 38) in each combination. Separately, pGI 17ompPRR was digested by Kpnl and SacI to give a 0.1 kbp frag- 
to ment (fragment 39). These fragments 38 and 39 were ligated to a 3.2 kbp fragment (fragment 40) obtained by digesting 
pG1 1 7ompPRR with Bglil and Kpnl and transformation was carried out. A plasmid was isolated from each clone thus 
obtained. 

[0123] pG117ompPRR-1 A was constructed as shown in Fig. 29 by using pG1 17ompPRR as a template and P10 
and P33: 5'-GCGGAGCTCCGCCTGTATGCTCGTCATCAC-3' as primers. The PGR product obtained by the combina- 
15 tion of the primers PI 0/P33 was digested by SacI to give a 0.1 kbp fragment (fragment 41 ). Separately, pG1 1 7ompPRR 
was digested by SacI to give a 3.4 kbp fragment (fragment 42). These fragments 41 and 42 were ligated together and 
transformation was carried out. A plasmid was isolated from each clone thus obtained. 

[0124] These expression plasmids pG1 1 7ompPRRXAs thus constructed were all subjected to restriction enzyme 
analysis and nucleotide sequencing at the mutation site and thus confimned as being the expression plasmids of the 
20 target fusion proteins PRRXAs. 

(Step 2) Preparation of fusion proteins PRR and PRRXA 

[0125] When pG117ompPRR and pG117ompPRRXA are expressed in E. cofi, the fusion proteins PRR and 
25 PRRXA are expressed as inclusion bodies. In a case where OmpT protease is expressed in E. coli. the Inclusion body 
is cleaved by OmpT protease merely by dissolving with urea. To avoid the cleavage, therefore, pG117ompPRR and 
pGI 17ompPRRXA were transformed into the OmpT protease-deficient E. coti strain W3110M25 and thus the fusion 
proteins PRR and PRRXA were prepared in the form of inclusion body. 

30 Example 13 : Cleavage of fusion proteins PRR and PRRXA by OmpT protease 

[0126] PRR and PRRXA were reacted with the purified OmpT protease specimen at 25**C for 60 minutes at pH 7.0. 
After the completion of the enzymatic reaction, SDS-PAGE analysis was carried out. The results are shown in Fig. 30. 
[0127] The PRR-1 A was not cleaved by OmpT protease (Fig. 30, lane 1A). Although the cleavage by OmpT pro- 
35 tease was confirmed in the fusion proteins other than PRR-1 A, the amount of the 4.9 kDa peptide fragment fomied by 
the cleavage differed from protein to protein. 

[0128] In order to quantitate the 4.9 kDa peptide fragment, the liquid reaction mixture of the above-described OmpT 
protease reaction was subjected to HPLC. In the OmpT protease reaction of each of the fusion proteins other than 
PRR-1 A, a peak was detected at a retention time of 8.8 minutes. The N-tenninal amino acid sequence of this peak at 

40 8.8 minutes detected in PRR had been determined and it had been clarified as a 4.9 kDa peptide fragment fomned by 
the cleavage at -QMHGYDAELRLYRJ.RHHG- (Table 4). Thus, it is assumed that the peak at 8.8 minutes formed by 
reacting each fusion protein with OmpT protease con^esponds to the 4.9 kDa peptide fragment. 
[0129] ~Rie relative peak area at the retention time of 8.8 minutes indtoates the amount of the 4.9 kDa peptide frag- 
ment formed by the cleavage. Table 4 shows the data of the relative amounts of the 4.9 kDa peptide fragment calculated 

45 by refen-ing the amount in the case of PRR as to 100. Excluding PRR-1 A, PRR-4A showed the smallest amount of the 
4.9 kDa peptide fragment and PRR-6A showed the largest one. Based on these results, it is considered that the -4- and 
-6-positions (other than -1 -position) largely affect the cleavage by OmpT protease. 

so Table 4 



Cleavage of fusion protein PRRXA (X 
ranging from -10- to -1 , excluding -7) 


PRRXA 


Relative amount of 4.9 
kDa peptide 


PRR 
PRR-1A 


100 
ND 
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Table 4 (continued) 



Cieavage of fusion protein PRRXA (X 
ranging from -1 0- to -1 . excluding -7) 


PRRXA 


Relative amount of 4.9 
IcDa peptide 


PRR-2A 


150 


PRR-3A 


44 


PRR-4A 


24 


PRR-5A 


89 


PRR-6A 


330 


PRR-8A 


200 


PRR-9A 


160 


PRR-10A 


160 



20 [0130] The 4.9 kDa peptide is formed by the cleavage of the fusion proteins by OmpT protease. The amount of the 
4.9 kDa peptide of PRR is refenred to as 100. ND means not detectable. 

Example 14 : Preparation of fusion proteins PRR-4X and PRR-6X 

25 [0131] The OmpT protease is an endoprotease which recognizes and cleaves mainly basic amino acid pairs. The 
substrate specificity of mammalian furin, which is also an endoprotease recognizing and cleaving basic amino acid 
pairs (cleaving the C-terminal side of basic amino acid pairs), has been studied in detail and it is reported that furin rec- 
ognizes arginine at the -1 -position and basic amino acids at the -2-, -4- and -6-positions concerning the cleavage site. 
[0132] The results of Example 13 indicate that the substitution of arginine which is the basic acid at the -4-position 

30 into alanine makes the cleavage by OmpT protease difficult, while the substitution of glutamic acid which is the acidic 
amino acid at the -6-position into alanine facilitates the cleavage by OmpT protease. 

[0133] Based on these facts, it was assumed that the cleavage by the OmpT protease might be affected by the 
charges on the amino acids at the -6- and -4-positions of the cleavage site. Thus, the cieavage by OmpT protease was 
examined by fomning fusion proteins wherein the amino acids at these positions were substituted by arginine and lysine 
35 (basic amino acids), aspartic acid and glutamic acid (acidic amino acids) and asparagine and glutamine (neutral amino 
acids being similar in structure to acidic amino acids). 

[0134] A fusion protein with the substitution at the -4-position was refenred to PRR-4X (Fig. 31), while a fusion pro- 
tein with the substitution at the -6-posltion was referred to PRR-6X (Fig. 32), wherein X represents one letter of the 
amino acid at the -4- or -6-position. The fusion proteins PRR-4X and PRR-6X were prepared by the following two steps. 

40 

(Step 1) Construction of pGI 17ompPRR-4X and pGI 17ompPRR-6X (Figs. 33 and 34) 

[0135] The expression plasmld of the fusion protein PRR-4X, wherein arginine at the -4-position of the OmpT pro- 
tease cleavage site -QMi-IGYDAELRLYRiRHIHG- of the fusion protein PRR had been substituted by lysine (a basic 

45 amino acids), aspartic acid or glutamic acid (an acidic amino acid) or asparagine or glutamine (a neutral amino acid 
being similar in structure to acidic amino acids), was refen-ed to as pGI 17ompPRR-4X, while the expression piasmid 
of the fusion protein PRR-6X, wherein glutamic acid at the -6-position had been substituted in the same manner, as to 
pG1 1 7ompPRR-6X. These substitutions were earned out by introducing DNA mutations into pG1 1 7ompPRR. 
[0136] The mutations were introduced by PGR. Expression plasmids pG117ompPRR-4K, pGI 17ompPRR-4D, 

50 pG117ompPRR-4E, pG117ompPRR-4N and pGI 17ompPRR-40were were constructed by the procedure shown in 
Fig. 33. pG1 17ompPRR was used as a template, while P10. P34: 5'-GCGGAGCTCAAACTGTATCGCCGTCATGAC-3*. 
P35: 5'- GCGGAGCTCGACCTGTATCGCCGTCATCAC-3' P36: 5'- GCGGAGCTCGAACTGTATCGCCGTCATCAC -3', 
P37: 5'- GCGGAGCTCAACCTGTATCGCCGTCATCAC -3' and P3B: 5'- GCGGAGCTCCAGCTGTATCGCCGTCATGAC 
-3' were used as primers. As the PGR products with the use of the combinations of the primers P10/P34, P10/P35, 

55 PI 0/P36, PI 0/P37 and PI 0/P38, the 0.3 kbp fragment (fragment 43) was obtained. PGR was earned out again by using 
pG1 1 7ompPRR as a template and P3 and the fragment 43 as primers to give the 0.8 kbp fragment (fragment 44). The 
0.1 kbp fragment (fragment 45) obtained by digesting fragment 44 with Nsii and Kpnl was ligated to the 3.4 kbp frag- 
ment (fragment 46) obtained by digesting pGll7ompPRR with Nsil and Kpnl, and transfonmation was earned out. A 
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ptasmid was isolated from each clone thus obtained. 

As shown in Rg. 34, expression plasmids pG117ompPRR-6R, pG117onnpPRR-6K, pG117onipPRR-6D, 
pGl17ompPRR-6N and pG1 1 7ompPRR-6Q were constructed by using pGl 17ompPRR as a template and P10, P39: 
5'-CAGATGCATGGTTATGACGCGCXaTCTCCGC-3', P40: 5*- CAGATGCATGGTTATGACGCGAAACTCCGC-3'. P41: 

5 5'- CAGATGCArGGTTATGACGCGGACCTCCGC-3', P42: 5'- CAGATGCATGGTTATGACGCGAACCTGCGC-3' and 
P43: 5'- CAGATGCATGGTTATGACGCGCAGCTCCGC-S' as primers. The PGR products obtained by using the combi- 
nations of the primers P10/P39, P10/P40. P10/P41, P10/P42 and P10/P43 were digested by Nsil and Kpnl to give a 0.1 
kbp fragment (fragment 46). Further, pG1 1 7ompPRR was digested by Nsil and Kpnl to give the 3.4 kbp fragment (frag- 
ment 47). These fragments 46 and 47 were li gated together and transformation was carried out. A plasmid was isolated 

10 from each clone thus obtained and restriction enzyme analysis and nucleotide sequencing at the mutated site were per- 
formed. Thus, these plasmids were identified as the expression plasmids pG1 1 7ompPRR-4X and pGI 17ompPRR-6X 
of the target fusion proteins PRR-4X and PRR-6XX. 

(Step 2) Preparation effusion proteins PRR-4X and PRR-6X 

15 

[0137] When pG1 17ompPRR-4X and pG117ompPRR-6X are expressed in E. coli, the fusion proteins PRR-4X and 
PRR-6X (Figs. 31 and 32) are expressed as inclusion body. In a case where OmpT protease is expressed in £. coli, 
these inclusion body is cleaved by OmpT protease merely by dissolving with urea. To avoid the cleavage, therefore, 
pGI 17ompPRR-4X and pGI 17ompPRR-6X were transfomned into the OmpT protease-deficient E. coli strain 
20 W31 1 0M25 and thus the fusion proteins PRR-4X and PRR-6X were prepared in the form of inclusion body. 

Example 15 : Cleavage of fusion proteins PRR-4X and PRR-6X by OmpT protease 

[0138] PRR-4X was reacted with the purified OmpT protease specimen at 25*0 for 60 minutes at pH 7.0. After the 
25 enzymatic reaction, SDS-PAGE analysis was performed. Fig. 35A shows the results wherein - represents a lane free 
from OmpT protease, and + represents a lane with the addition of OmpT protease (2.0 U/ml). 
[0139] Although the cleavage by OmpT protease was observed in ail of the fusion proteins, the amount of the 4.9 
l<Da peptide fragment formed by the cleavage differed from protein to protein. 

[0140] Then the 4.9 kDa peptide fragment was quantitated by using i-iPLC. In the cases of adding OmpT protease. 
30 peaks were detected at retention time of B.8 minutes. As described in Example 1 3, these peaks are seemingly assign- 
able to the 4.9 kDa peptide fragment. 

[0141] Table 5 shows the data of the relative amounts of the 4.9 kDa peptide fragment calculated by referring the 
amount In the case of PRR as to 1 00. The relative amounts of the peptide fragment formed by the cleavage were from 
2 to 3% (PRR-4D and PRR-4E) or from 20 to 60% (PRR-4A. PRR-4N and PRR-4Q) or almost comparable to PRR 
35 (PRR-4K). Based on these results. It is considered that OmpT protease recognizes the electrical charge of the amino 
acid at the -4-position 



Table 5 



Cleavage of fusion protein PRR-4X (X 
being K, A, N, Q, D or E) 


PRR-4X 


Relative amount of 4.9 
kDa peptide 


PRR 


100 


PRR^K 


96 


PRR-4A 


49 


PRR-4N 


48 


PRR-4Q 


23 


PRR-4D 


2.8 


PRR-4E 


2.0 



55 

[0142] The 4.9 kDa peptide is fonned by the cieavage of the fusion proteins by OmpT protease. The amount of the 
4.9 kDa peptide of PRR is refen-ed to as 100. PRR has Arg at the -4-position. 
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[0143] As shown in Rg. 32. the cleavage by OmpT protease was further examined by using the fusion protein PRR- 
6X wherein glutamic acid at the -e^sosition of the fusion protein PRR had been substituted. PRR-6X was reacted with 
the purified OmpT protease specimen at 25''C for 60 minutes at pH 7.0. After the enzymatic reaction, SDS-PAGE anal> 
ysis was perfonmed. Rg. 35B shows the resutts wherein - represents a lane free from OmpT protease, and ^ represents 

5 a lane with the addition of OmpT protease (0.1 UAnQ. 

[0144] Although the cleavage by OmpT protease was observed in all of the fusion proteins, the amount of the 4.9 
kDa peptide fragment fonned by the cleavage differed from protein to protein similar to the case of PRR-4X. 
[0145] Then the 4.9 kDa peptide fragment was quantitated by using HPLC. Table 6 shows the data of the relative 
amounts of the 4.9 kDa peptide fragment calculated by refening the amount in the case of PRR as to 1 00. The relative 

10 amounts of the peptide fragment formed by the cleavage were almost comparable to PRR (PRR-6D), from about 3 to 
4 times as much as PRR (PRR-6A, PRR-6N and PRR-6Q) or about 1 0 times as much as PRR (PRR-6R and PRR-6K). 
Based on these resutts, it is considered that OmpT protease recognizes also the electrical charge of the amino acid at 
the -6-position too. 

15 

Table 6 



20 



Cleavage of fusion protein PRR-6X (X 
being R, K, A, N, Q or D) 


PRR-6X 


Relative amount of 4.9 
KDa peptide 


PRR 


100 


PRR-6R 


1000 


PRR-6K 


1400 


PRR-6A 


310 


PRR-6N 


430 


PRR-6Q 


390 


PRR-6D 


110 



[0146] The 4.9 kDa peptide is fonned by the cleavage of the fusion proteins by OmpT protease. The amount of the 

35 4.9 kDa peptide of PRR is refen-ed to as 1 00. PRR has Glu at the -6-position. 

[0147] These results suggest that OmpT protease recognizes the amino acids at the -4- and -6-positions in the 
digested site of the substrate and the cleavage ratio is elevated in the case wherein the amino acids at these positions 
are basic amino acids but lowered in the case wherein the amino acids at these positions are acidic amino acids. 

40 Example 16 : Application to sequence cleavable by OmpT protease 

[0148] Based on the results of Example 15, it is assumed that the cleavage efficiency of OmpT protease can be 
elevated by converting the amino acids at the -4- and -6-positlons of the known OmpT protease cleavage site into bask; 
amino acids. From this viewpoint, a fusion protein RShANPR (Rg. 36) was formed by substituting the amino acids at 
45 the -4- and -6-positions of the OmpT protease cleavage site of the fusion protein RShANP (Fig. 21 ). having a structure 
cleavable by OmpT protease and thus releasing a-hANP, by arginine (a basic amino acid) and then it was examined by 
the following three steps whether or not these fusion proteins differed from each other in the cleavage by OmpT pro- 
tease. 

50 (Step 1) Construction of pGRShANPR (Fig. 37) 

[0149] The expression plasmid of the fusion protein RShANPR, wherein alanine at the •4-posttion and tyrosine at 
the -6-posltion of the OmpT protease cleavage site -QMHGYDAQFRiSLRR- of the fusion protein RShANP had been 
substituted each by arginine, was refen-ed to as pG RShANPR. These conversions were carried out by introducing DNA 

55 mutations into pGRShANR 

[0150] The DNA mutations were introduced by PGR. As shown in Fig. 37, pG RShANP was used as a template and 
P10 and P44: 5'-ATGCACGGTCGTGATCGTCAATTCCGTAGC-3' were used as primers. As the product of PGR with 
the use of the combinations of the primers P10/P44, a 0.3 kbp fragment (fragment 48) was obtained. Then PGR was 
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carried out again by using pGRShANP as a template and the P3 and the fregnient 48 as printers to ^ve the 0.6 kbp 
fragment (fragment 49). The 0.2 kbp fragment (fragment 50) obtained by digesting fragnnent 49 with Bglll and EcoRI 
was llgated to the 3.0 kbp fragment (fragment 51) obtained by digesting pGRShANP with Bgill and EcoRI, and trans* 
fonmatlon was carried out A ptasmid was isolated from each clone thus obtained and restriction enzyme analysis and 
5 nucleotide sequencing at the mutated site were performed so that it was Mentified as the expression ptasmid 
pGRShANPR of the target fusion protein RShANPR. 

(Step 2) Preparation of fusion proteins RShANP and RShANPR 

10 [0151] When pGRShANP and pG RShANPR are expressed in E. co//, the fusion proteins RShANP and RShANPR 
(Fig. 36) are expressed as inclusion body, in a case where OmpT protease is expressed in E co//. these inclusion body 
is cleaved by OmpT protease merely by dissolving with urea. To avoid the cleavage, therefore, pGRShANP and 
pGRShANPR were transformed into the OmpT protease-def icient E. colt strain W31 1 0M25 and thus the fusion proteins 
RShANP and RShANPR were prepared in the form of inclusion body. 

15 

(Step 3) Cleavage of fusion proteins RShANP and RShANPR by OmpT protease 

[0152] RShANP and RShANPR were reacted with the purified OmpT protease specimen at 25*'C for 90 minutes at 
pH 7.0. After the completion of the enzymatic reaction, the peptide fragment thus released was quantitated by HPLC. 

20 In the case of adding OmpT protease (2.0 U/ml), a peak was detected at the retention time of 4.7 minutes. By isolating 
this peak and detennining the N-terminal amino acid sequence, it was identified as a-hANR 
[0153] The relative peak area at the retention time of 4.7 minutes (i.e., the relative amount of the released o-hANP) 
of RShANPR was 2.2 times as much as that of RShANP Based on these results, it is expected that the cleavage effi- 
ciently by OmpT protease can be elevated by converting the amino acids at the -6- and -4-positions of the known OmpT 

25 protease cleavage site into basic amino acids (arginine in the above case). 

EFFECTS OF THE INVENTION 

[0154] In one aspect of the method according to the present invention, use is made of the properties of OmpT pro- 
30 tease that it shows a highly specific effect of cleaving exclusively the bonds between arginine-X and lysine-X (wherein 
X represents an amino acid other than glutamic acid, aspartic acid or proline) located in specific amino acid sequences. 
Therefore, use of OmpT protease makes it possible to, for example, select a peptide constructing from amino acids 
other than glutamic acid, aspartic acid or proline as the N-terminal amino acid in the case of excising the target peptide 
from a fusion protein expressed by genetic engineering techniques, and to avoid the cleavage at undesired peptide 
35 bonds by converting the amino acid at the +1 -position into glutamic acid, aspartic acid or proline. 

[0155] In another aspect of the method according to the present invention, use is made of another properties of the 
OmpT protease that it recognizes the charges of the amino acids at the -6- and -4-positions. In case of excising the tar- 
get peptide from a fusion protein expressed by genetic engineering techniques similarto the above-described case, the 
cleavage ratio can be elevated by converting the amino acids at the -6- and -4-positions into basic amino acids and the 
40 cleavage at undesired peptide bonds can be minimized by converting the amino acids at the -6- and -4-positions into 
acidic amino acids. When a fusion protein is expressed in inclusion body, OmpT protease is recovered together with the 
inclusion body Therefore, the present invention is particulariy effective in the case of using E coli as a host. 

Claims 

45 

1. A method of controlling cleavage of a polypeptide by OmpT protease which comprises converting a sequence site 
consisting of two arbitrary consecutive amino acids and/or amino acid(s) in the vicinity of, said site in said polypep- 
tide Into other amino acids, characterized by (1) setting lysine or arginine as the amino acid at the -1 -position con- 
cerning said site and setting a specific amino acid as the amino acid at the +1 -position; and/or (2) setting specific 

50 amino acld(s) as the amino acid(s) at the -4-position and/or the -6-position relative to said site; so that a desired 
part of said polypeptide is cleaved by OmpT protease and/or an undesired part of said polypeptide is not (or hardly) 
cleaved by ompT protease. 

2. The method as claimed in claim 1 characterized by. in a case where the amino acid at the -1 -position of the 

55 sequence site consisting of two artaitrary consecutive amino acids in said polypeptide is neither lysine nor arginine, 
converting said amino acid into lysine or arginine and setting an amino acid X (wherein X is an amino acid other 
than glutamic acid, aspartic acid, proline, arginine, lysine, alanine, methionine or valine) as the amino acid at the 
+1 -position so that a desired part of the polypeptide is cleaved by OmpT protease. 
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3. The method as claimed in claim 1 which comprises, in a polypeptide having a sequence site consisting of two arfoi- 
trary consecutive amino acids deavable by OmpT protease, converting the amino adds at said site anchor in the 
vicinity thereof into other amino acids, characterized by (1) setting lysine or arginlne as the amino add at the -1* 
position of said site and setting a spedfic amino acid as the amino acid at the 4-1 -position; and^or (2) setting specific 

5 amino acid(8) as the amino acid(s) at the -4-position and/or the -6-position relative to said site; so that a desired 
part of said polypeptide is deavable by OmpT protease. 

4. The method as claimed in claim 3 which comprises, in a polypeptide having a sequence site consisting of two arbi- 
trary consecutive amino acids deavable by OmpT protease, converting the amino acids at said site and/or in the 

10 vicinity thereof into other amino acids, characterized by (1 ) setting lysine or arginine as the amino acid at the -1- 
positton of said site and setting an amino acid X (wherein X is an amino acid other than glutamic acid, aspartic acid 
or proline) as the amino acid at the +1 -position; and/or (2) setting an amino acid X (wherein X is an amino acid other 
than acidic amino acids) as the amino acid(s) at the -4-position and/or the -6-position in said sequence. 

15 5. A method as claimed in claim 1 , 3 or 4 characterized in that the amino acid(s) at the -4-position and/or the -6-posi- 

tion are basic amino acid(s). 

6. The method as claimed in claim 5 characterized in that the amino acid(s) at the -4-position and/or the -G-position 
are lysine or arginine. 

20 

7. The method as claimed in claim 1 which comprises, in a polypeptide having a sequence site consisting of two arbi- 
trary consecutive amino acids deavable by OmpT protease, converting the amino acids at said site and/or In the 
vicinity thereof into other amino acids, characterized by (1) setting lysine or arginine as the amino acid at the -1- 
position of said site and setting a specific amino acid as the amino acid at the +1 -position; and/or (2) setting specific 

25 amino acid(s) as the amino acid(s) at the -4-position and/or the -G-position relative to said site: so that undesired 
parts of said polypeptide are not (or hardly) cleaved by OmpT protease. 

8. The method as claimed in claim 7 which comprises, in a polypeptide having a sequence site consisting of two arbi- 
trary consecutive amino acids deavable by OmpT protease, converting the amino acids at said site and/or in the 

30 vicinity thereof into other amino acids, characterized by (1 ) setting lysine or arginine as the amino add at the -1 - 
position of said site and setting an amino acid X (wherein X is glutamic acid, aspartic acid or proline) as the amino 
acid at the +1 -position; and/or (2) setting acidic amino acid(s) as the amino acid(s) at the -4-position and/or the -6- 
position in said sequence. 

35 9. A method as claimed in any of claims 1 to 8 comprising expressing in host cells a gene, which encodes a fusion 
protein consisting of a target polypeptide fused with a protective peptide via a cleavage site (optionally located in a 
linker peptide) and being deavable by OmpT protease at said cleavage site, and cleaving off the protein at said 
cleavage site by OmpT protease to thereby obtain the target polypeptide from said fusion protein. 

40 10. The method as claimed in claim 9 wherein an amino acid sequence deavable by OmpT protease exists in the 
amino acid sequences of the protective peptide, the linlcer peptide and/or the target polypeptide constituting said 
fusion protein. 

11. A method of producing a target polypeptide which comprises expressing in host cells a gene, which encodes a 
45 fusion protein consisting of a target polypeptide fused with a protective peptide via a cleavage site (optionally 

located in a linker peptide) and being deavable by OmpT protease at said cleavage site, and cleaving the protein 
at said cleavage site by OmpT protease to thereby obtain the target polypeptide from said fusion protein, charac- 
terized by converting the amino acids at said cleavage site and/or amino acid(s) in the vicinity of said site as 
described in any of claims 1 to 8. 

50 

12. The method as claimed in daim 1 1 wherein an amino acid sequence deavable by OmpT protease exists in the 
amino acid sequences of the protective peptide, the linicer peptide and/or the target polypeptide constituting said 
fusion protein. 

55 13. A method as claimed in any of claims 1 , 7 and 8 wherein a gene encoding a polypeptide is expressed in host cells 
so that said polypeptide is deavable at an undesired part by OmpT protease. 

14. A method of producing a polypeptide whteh comprises expressing a gene encoding the polypeptide in host cells, 
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charactertzed in that when said polypeptide is cleaved by OmpT protease at undesired parts, annlno acid(s) are 
converted as described in any of daims 1 to 8. 

15. A method as claimed in any of claims 9 to 14 wherein the host cell Is Escherichia coll. 

5 

16. A method as claimed in any of claims 9 to 15 wherein said target polypeptide is a natriuretic peptide. 



10 



40 



50 
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'mjiJ — 1 I I ^—r-lT — 

PQ117S4HGP 



stui 
digestion 



5'-Phosphorylatton 



R6 synthetic DNA 

S*CAGAIQQ^<KnTATGACQCGGAGCTCCQGCTOTATCGC 
3*GTCIAfi&IACCAATAGTGCGCCTCGAGGU:;CGAGATAQCQ 

CGTCATCACCGGrrGGGGTCGITCCfifiAICfiS* (SEQIDN0:74) 
GCACTAGrGGCCACOCCAGCAAQG^CKSZMUaS' (SEQ ID NO:75) 



Amino add sequence In R6 moiety 
sQMHGYDAELRLYBEHHRWQRSQS 
(SEQIDN0:61) 



t 



OmpT protease deavage site 



BtrrM \ 



r 



PG117S4HR6GLP-1 



p-gal117S4H 
PSRHKR 
GLP-ltG] 
R6 



Tc' 



Rg. 1 
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r 



PGM7S4HR6GLP-1 




1 



BamHI-HlndlU 
digestion 



AVIAmHI 



0.2MP (Fragment 2) 
S'^Phosphofylfttkin 



UsynthencDNA 

5* TGGTTATQACGCOGAQOTCCQOCTQTA 
rACGTACCAATACnaCGCGTCGAQQCQQACAT 

TCGCCCTCATCACGCmCCG 3tSEQ ID NO:76) 
AGCGGCAGTAQTQCCAAG0CGTA05*(SEQ 10 N0:77> 



Amino add sequence in u moiety 
sQMHGYDAaRLYBBHHQSGS 
(SEQIDNO:62) j 

OmpT protease cleavage site 



I pG117S4HR6GLP7l 



Ugation 



PG117S4HR6GLP-1 



I 



' I PGI 



Nsll-Hlndtn 
(figestlon 

BmHl 



PG117S4HR6GLP-1 J 



3.2M]p (Fragment 1) 



pGl17S4HompRHKR 



p-gal117S4H 
PSRHKR 
GLP-1[G] 
R6 

LI synthetic DMA 



Rg- 2 
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pQ117S4HompRHKR 



Pi: rokcrcAQAIfiXrocTGMicaocGATr Hindm-Sphi 
£Effl (SEQ ID N0:3) digestion 

P2l rAAAGOTACCTTOCflCAIQCOKGCMTGTOQAGMt^ 

, (SEQIDN0:4) 

M StAHtnm 



PGR with primers PI and P2 
(oonverslon of PSRHKR 
Into PSRHPR) 



i 

HI 

\ 




tii-J 

o^hfap (FregmentS) 



BglU'Sphl 




0.liibp (Fragments) 



Ligation 



pG117S4HompRHKR 




(R8gnient4) 



p-xani7S4H 
PSRHKR 
GLP-lCG] 
R6 

To'' 

LI synthetic DNA 

PSRHPR 

Unker peptide 

s LI synthetic DMA 

+ PSRHPR 



ED 



Rg.3 
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15 

Met Thr Met lie Thr Asp Ser Leu Ala Val Val Leu Gin Aig Lys . 

30 

Asp Trp Glu Asn Pro 01/ Val Thr Gin Leu Asn Arg Leu Ala Ala 

45 

His Pro Pro Phe Ala Ser Trp Arg Asn Ser Asp Asp Ala Arg Thr 

60 

Asp Arg Pro Ser Gin Gin Leu Arg Ser Leu Asn Gly Glu Trp Arg 

75 

Phe Ala Trp Phe Pro Ala Pro Glu Ala Val Pro Glu Ser Leu Leu 

90 

Asp Leu Pro Glu Ala Asp Thr Val Val Vai Pro Asp Ser Ser Asn 

105 

Trp Gin Met His Gly Tyr Asp Ala Pro He TyrThr Asn Val Thr 

120 

Tyr Pro lie Thr Val Asn Pro Pro Phe Vat Pro Thr Glu Pro Hs 

135 

hfishisHisPro Gly Gly Arg Gin Met his Gly Tyr Asp Ala Glu 

150 

Leu Arg Leu Tyr Arg i^^HishGsGly Ser Gly Ser Pro Ser Arg 

165 

His Pro Arg HIsAla Glu GIvThr PheThrSer Asa Val Ser Ser 

180 

Tyr I eu Qlu ftly P^ln Ala Ala I y« dUt Phi> lift Ata Trp I i>it Val 

LysGIy Arg Gly 



Rg. 4 
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PS: S'AOOCCAGCCrrrACACnTArCSEQ ID HOlt) 

P4X: S'OGfiS2aifiCCTQ4TGtilfQCQATAGAGCO(a'(^ ID N0:7) 

BamH I Cofmpond l ng tD amtno add X 



PCR with primers 
P3an<IP4X 



1 



Pvul-eamHI 
digestion 



Ac4 BmMi 



OJhte (Fresmente) 



Ligation 



laePO ' ' ' » I I 



kd?0 




P& rACfiQABSOOTTOCCCrTATCQACATOCGr (SEQ ID N0:8) 
6«RiH( 

Pft STTOCGCATTCACAGTTCTCCr (SBQ ID N0:9) 



PGR With 



th primers I 
PSandK y 



BamHl-Hlndm 
digestion 



I 



02Mp(FAgment7) 



PvuX-Nlndm 
digestion 



pG117S4HompRHPR 






3jOUa(Fngineiit8) 




p-gal117S4H 




Unk«r peptide 


CD 


GLP-1[G] 


SI 





Rg. 5 
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Linker peptide 



Pre GLP-1[G1 



M \ 

t^HGW)AElJ^-YRXHHGSGSPYRHPR 4GLJ>-1[Gl 

128 140 A Ml 153* ' ^ 

Ompl protease 



1$4 



.6 
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iln&gductton of EooRI and SaO 
restriction enzyine sites 
by sfte-spedflc mutagenesis 
EcoM-SaO digestion 



1 JMbp (ftagnwnt 9) AM* I 



I 



AlWNI*Saa 
digestion 

su 



Ugation 



M>0 



2.3fchp (Fragment 11) 





i>7: ff Qcami crrTOQOOQOTgrcor (SBQ id no:io) 

PL- STfiAAZIcnrocTaTaraAAATTffrTATX (SCq id no:11) 

PGR with primeis P7 and P8 I 
AlwNI-EooiU digestion ^ 

0.5Vbp (Fragment 10) 



p-gal117S4H 
Linker peptide 
CZD GLP~1[G] 
■■ OmpT 



Rg. 7 
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Tc 



pOmpTTc 




JatPO 



i 



H lncn;Mf eI 
dlpgstloo 



1.0ttp(r^a9incntU) 




2^kfap(FAgnMnC 13) 




Pfl: TTflftAZZCAAAATQCaMOQAAACTQCrQQGr <SEQ 10 NO:12} 
&0RI 

P1QE STOOCGAOGATQAOQ^TCAQCr (SEQ ID N0:13) 



PCR With primers P9 and PIO 
(BcUi^ttng the distance from SD 
sequence to OmpT protease translation 
inlBadon site to 9 bases) 



FeoRI/Aiel J^»l Sal 



Etx>RI-HincI] digestion 



Ugation 



asr 

(Uiifap(Fragmefit 14) 



itePQ 




OmpT 



Rg. 8 
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tePO 




Ecom-San digestion 

1.3 Wp (Fragment X5> ^ 



Ligation 




AMHt 



2J3Uo (Fragment 16) 



Te 



P12 



pOmpTTc 



D 



PI t: yCTATCCrCQCCOCACTTATCrCSEQ IP HOilA) 

Pit rTQ&AmT T OC r arCTCrTA Al > l » l ATCCOCTCACAATTy (SCQ ID NO:15) 



PCR with Pll and P12 
(modifying bases in the secondary niRNA 
structure from transcription initiation site 
to the Godon of the fifth amino add fbllowing 
the OmpT protease translatton initiation In 
order to mininUze stem and loop formation) 



AMil 



kePO 



i 



EooRI'AlwNI digestion 



4AM I _t£coRl 



O.SIdbp(FrBgfnent 17} 




OmpT 



Rg. 9 
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1.3l(fap(Fk«9ment 18) 



ligation 



2.81^ (Fragment 19) 




OmpT 



Rg. 10 
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TCYNQ DEKR. H 



Rg. 11 
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P3: rAOGCCAGGCrrTTACACrrTAr 



Ptax: 5'CCSSAZ££Cn'QATQttflimATACAQGCGr (SCQ ID NO:37) 

BmMX Corresponding tD amino acMX PnAmnO, BmmiMf&nSl Bmfil 



PCR witli primers 
P3andP13X 



1 




PvuI-BamHI digesUon 

PyU BmMi 



i 



BamHI-San digestion 



1 



OJShfcpCRagmentZO) 



hePO 



Ugatlon 



P>*AHattm BmiH I SaA HMa BarrM t 



21) 



Pvul^SaD dlgesuon 



pGP1 1 7owpPRR 




p-sal117S4H 
Linker peptide 
GLP-l[Gl 



Rg. 12 
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Pre6U>-1[G] 



B-gal 117S4li QMHGYDAELJRLYKXHHGSGSPYRHPR 4^^ 

19ft 1401 Ml 



Linker peptide 



128 



153 



184 



OmpT protease 



Rg. 13 
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kOa 




peptide fragment 



RfrO - + - + - + - + - + - + 
i — » I — II — II II II I 

KA KS KK KR KD KE 

Rg. 14 
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M Sal 



iacPO 



Pt4 



P15 

pGHQf97Sn 



n r 




P14: rCCQQ&Qtn&CGOGTGTATOGCAGCCTOCOaAGATCCAOCTCr (SEQ ZD NO:44) 
P1& rCTGAfilCCACTeAOTAOOGCr (S6Q ID N0:45) 



PGR with prtmers P14 and P15 



i 



Ad sa 
Sad-Sall dioestion ^ 



aiMv(nrB9nieftt23} 



UgBttan 



pGn7ompPRR 



Bxil^NsiX Sad SalHJndli ffmtHX^ 



Satl-SaU digestion 



Sad s»i 
pG117ompPRR 




3.4Up rFragment 24) 

■ p-g8l117S4H 

p-gald7S 
■■ GLP-1[Gl 

CZ3 Of-hANP 

S To'- 

HI Unker peptide 1 

^SeS Linker peptide 2 

Linker peptide 3 



Rg. 15 



45 



EP1076097A1 



Linker peptide 



or-liANP 



-H 



I g-gal 1 17S4li QMHGYDAELRLYRSLRRSSCFGGRMDRIGAQSGLGCNSFRY 
m MO 4 Ml t«s 

OtnpTpratoata 



Rg. 16 
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ItePO 



P17 

pG97S4DhCT[G]R4 




Pifi: S'OCQG&fiCISCGCCTYn'ATCaTGTOanrAACCTGAOCA^ (SGQ ID NO:46) 

pi7^ 5'c^c^ax£Qacr^AGOcccoGr (seq id uo-at) 

StA 

PCft with primens F16 and P17 



Sad-Saa digestion 



Sad-SaQ digestion 



pQIITompPRR 



AcPO 




Rg. 17 
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Linker peptide hCqG] 
1< — ►h^ H 

xu t4oiui m 

OmpTproteasft 

Rg. 18 
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Rg. 19 
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itePO 




Plfc rTAC5GATCC0CAATTCCaL*«0CTQC00r(SEQ 10 110:48) 

Corresponding to BfBlnJne 

PI t: $TCCCTQACT0CCrrTA0CAATTTAACTCTGAT3-(SEQID NO:49) 



PGR Witt) primers 
P18andP19 



EeoRi 



PCR product P20 
0.2hfap 



Boin-EcoRIdlgestton 




P2l:STTATOQ0CACrQQCAOCAQea'(5EQ ZD NOrSO) 



iRt 



1 
I 



PCRwKh PGR product P20 
and primer P21 



£b«RE 



1.0M>p 

Bgn-EcoRI digestion 



pGHa97Sn 



3Mp (Fragment 28) 



m\\ 

a2l(bp(ft«oment27) 



Ugatlon 



£ccRl 




Rg. 20 



■i p-fial97S 

Linker peptide 1 
CZJ Of-hANP 

HS Linker peptide 2 



50 



EP 1076 097 A1 



15 

Met Thr Met lie Thr Ser Leu Ala Val Val Leu Gin Arg Arg 

30 

Asp Trp Glu Asn Pro Gly Val Thr Gin Leu Asn Arg Leu Ala Ala 

45 

HsPro Pro Phe Ala Ser Trp Aig Asn Ser Glu Glu Ala A19 Thr 

60 

A^ Arg Pro Ser Gin Gin Leu Arg Ser Leu Asn Gly Glu Trp Arg 

75 

Phe Ala Trp Phe Pro Ala Pro Glu Ala Val Pro Glu Ser Leu Leu 

90 

Glu Leu Pro Glu Ala Asp Thr Val Val Val Pro Asp Ser Ser Asn 

105 

Trp Gin Met His Gly Tyr Asp Ala Gin Phe Arg 1 fieri an Ary Arg 

120 

Sar Ser Cva Pha Gly Gl v Am Met Asp Am lla Gl v Ala Gin Sar 

g|y Lev GlY Cys Sgr Phe Aw Tvr 



21 
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ibePO 




P3: S-ACCCCAOOCTTTACACnTAa" pxax: ffCAATTtXGTttftlCTOWXUOATCCAOCTOtSr (SEQ ID NO:58 ) 

pm: 5TcTccocv«^MHACCtt^AT7cmuT^ ID NO:57) COfrespondlnfl tD amino add X 

Ctonespondlng to amino add X s-QccTaAcroccTTAOCAATTTAAcrorwTT (5£Q id no:5») 



PCR wtth prfmers 
P3andP22 



PCR wfth primers 
P23X andP24 



BgA 

PCR product 29 



1 



SooPl 





PCR wtth primers 




P3andP24 




r 




£con 



PCR product 30 



tePO 




p-gal97S 
Linker peptide 

Modified a-hANP 



Rg. 22 
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Linker peptide Mxiified cr-hANP 

W *0l|l02 129 

OmpT protease 



Rg- 23 
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200 

97 

55 

36 

14 

6 

3.5 




I Fusion proteins 
RShANP and RXhANP 

I a-hANP and a-hANP 
derivative 



I-J UJ UJ L_J 
RS RR RA RC 



Rg. 24 
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Unker peptide 



-10 -5 
p-gal 117S4HI — Gly Tyr Asp Ala Glu Leu Ai^ Leu Tyr 



-1 +1 

Ai^^Ar^ His His Gly — |gLP-1[G] 



OmpT 

Gly Tyr Asp Ala Glu Leu Arg Leu Tyr Ala Arg His His Gly 

Gly Tyr Asp Ala Glu Leu Arg Leu Alfl Arg Arg His His Gly 

Gly Tyr Asp Ala Gli> Leu Arg Ak Tyr Arg Arg His His Gly 

Gly Tyr Asp Ala Glu Leu Ala Leu. Tyr Arg Arg His His Gly 

Gly Tyr Asp Ala Glu Aifi Arg Leu Tyr Arg Arg His His Gly 

Gly Tyr Asp Ala Ala Leu Arg Leu Tyr Arg Arg His His Gly 

Gly Tyr Ala Ala Glu Leu Arg Leu Tyr Arg Arg His His Gly 

Gly Ala Asp Ala Glu Leu Arg Leu Tyr Aug Arg His His Gly 

raR-lOA ^ Tyr Asp Ala Glu Leu Arg Leu Tyr Aig Arg His His Gly 



FRR-2A. 

PRR-SA 
SRR-6A 



Rg. 25 
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ibcPO 



P10 



P26. P27 

pG117ompPRR 




ibcPO 



P1Q: rrOOOQAOGATaAOQATOAOCr (SEQ ID N0:13) 
Pik S'GCGfi&QfZfiCaCCTQGOrOOCCCrCATCACS' (SEQ 10 N0:80) 
P2f: < w>fl [^ QCftf**"^^ l ATcoaogreATfuca- (SEQ 10 HO'Al) 
Pth, S'OCQCAfiaCGCTCTGTATOQOCOTCATCACr (SEQ ID N0:82) 

PCR Wltft primers PIO, 
P25, P26 ond P27 



i 



sacz-Kpia digestion 



Ligation 



BgA Stdi KfirA 




■C" 



«PB 



Bgin-Kpnl 



pG117ompPRR 




3^M«>(Ragment35) 

HI p-gal117S4H 
1^9 Linker peptide 
I 1 GLP-1IG] 

[=1 Tc'' 



Rg. 26 
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tePO 



PIO 




P28. P2& 

pGll7ompPRR 




PIOc STGCQGACOaOACMTGfcGCr (5EQ ID N0:i3) 
P2a: SrCAQmcaQOrTATQAGGCOCMOaCTOGCr (SEQ ID ffO:83) 
P29: SXAGfiZfiC^QCnTATCACQCGQCTCrOOCCr (SEQ 10 NO:84) 

NsA 

f>CR witti primers PIO, 
P28an(}P29 



J 



Msil-Kpnl digestion 



0.1kbD (Freonient36) 



UgsUon 



itaePO 



pG1l7ompPRR 



NslMCpia (SgeSdon 



b — 1= 

pG117ompPRR 



3.4ttp(FrtQtnent37> 




p-gal117S4H 
Linker peptido 
GLP-UG] 



Rg. 27 
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tePO 




kePO 



P3: rAC0CCA0GC1TTACACrTTA3'(SEQ 2D NO:6) 

P3ft S^OCGOUSafiCCCAOCATAACCATQC^TCTCr (SEQ 10 110:85) 

P9t: rocQfiAmccocorrcAOCAccATQCATCTor (seq xd no*^) 

P3t S'OGXQ&CCICCOCGTCATAAQCATOCATCrar (SEQ ID N0:87) 
SmtA 

PCR virfth primers P3, 
P30, P31 and P32 




I 



Sad-BgQl digestion 



0.2Up (Fragmenc 38) 



UePO 



BtR Sad 



pG117ompPRR' 



Kpra-SatI 
digestion 



IT 



0.1 ttp (Fragment 39) 



— 



pQ117ompPRR 



BglU-KprU 
digestion 



=>RR 






3.2M9(n«gment40) 




p-galll7S4H 




Linker peptide 




GIP-1[G] 




Tc'' 



Rg. 28 
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hePO 



BiC SkI Sad p|0 




ibePO 



Sin S0d SkI 




PIO: 5TQCCGAGGA7GA0QATQA0C7 (5EQI0 N0:i3) 

P33: 5QCGiflafiCZCC00CrrcrATQCTC0rCATCAC3' (SGQ to N0:88) 



PGR wfth primers 
P10andP33 



1 



i 



Sad digestion 



SgtA Std 

^ — 

0.1U1P (Ff90fMnt41) 

UgsUon 



Sad digestion 



( PGI 



pG117ompPRR 



3.4kbp{RrBomem42) 



itePQ 



BgA fOri 






p-eal117S4H 




Linker peptide 




GLP-1[G] 







Rg. 29 
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kDa 




LJ LJ LJ LJ LJ. LJ 

SRR -lA -3A -4A -rSA 

ItDa 




LJ LJ I—I LJ LJ 



PRR -6A -8A -9A -lOA 

30 
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i3-gal 117S4H 



Linker peptide 



-10 



-5 



— G!y Tyr Asp Ala Glu Leu Aiy Uu Tyr Ang^Ar^ His His Gly — |GLP-1[G3 



-1 +1 
^ng^Arg 
QmpT 



+4 



PRR-4K Lys 
Ala 

FRR-4H Asn 
FBK-4Q Gin 
Asp 

EBR-4B Glu 



Rg. 31 
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PBR 



linker peptide 



-10 



-1 +1 



p-gal n7S4H| — Gly Tyr Asp Ala Ghi Leu Arg Leu Tyr Arg Arg His His Cly — |gLP-1[G3| 



'AT 
OnpT 



Axg 
Lys 

PRR-6A Ala 
SSR**6H Asn 

EBR-6Q Gin 
PBR-6D Asp 



Rg. 32 
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hePO 



PIO 



P34. P35. P3B. P37, P38 
pG117onnpPRR 




iacPQ 



PIO: S700CGAOQATQACGATQAQC9' (SEQ ID N0:13) 
P34: rQCQQAQCrCAAACTaTATCQ0COrCATCACr(SSQ 10 NO:89) 
P39: S'QCQQAaCTCtMCCTarATCGCCGTCATCACrCSEQ ID NO.-90) 
P3«: S-OCOQAOCTCQfVACTGTATCGOCGTCATCACr (SEQ 10 NO:91) 
P37: roCCGAQCTCAAOCTtiTATCCOCGTC^TCACr (SEQ ID NO:92) 
P3»: S-QCGQAtSCTCCAGCTGTATCCtCCGTCATCACr (SEQ ID NO:93) 



Nsa-Kpnl 



H_J — 



0.3ld»(ft«gmefit43) 



th 



pG117ompPRR 



iPRR I 



3.4lcfap(Ffagnwnt46) 



Ligation 



JtaePO 




PCR with prfmens PlOi 
P34, P35, P36, P37 
9ndP38 



FfBgirwit 43 
pG117ompPRR 




1 



PCR with primers P3 
andfr&Qment 43 




Nsa-Kpnl 
difiestion 




p-gal117S4H 
Unker peptide 
GLP-1[Q] 



Rg. 33 
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itcPO 



BiA AM 



PtO 




PZB. P40. P41.P42. P43 

pQ117ompPRR 




itePO 



PI 0: STGCCCAGOATOACQATCUQCr (5EQ ID N0:13) 
P», 5'CAQ(^TOCATOaTTATaACOCOCOTCTCCGC3-(SB2ID N0:94} 
P40: yCAOATOMTCCTTATGACOCGAAACTCCCCriSeQID N0:95) 
P4I: S*CMMT0CATGarrATaACGC0QACCT0CGC3'(SEQXD M0:96) 
P42: rCACMTaCATGOrTATOAOOCfiAACCTOCacr (Saj ZD 110:97} 
P43: rCAOAT<»ATGGnATOACOCOCAGCTOCGC3*(5BQID 110:98) 




PCR wtth primers PlO, 
P39, P40, P41, P42 
and P43 



'■i 



Nsn-Kpnl digestion 



^Msa-Kpnl digestion ^'^^^i^^t^S/lli 



0.1kfap(Fr«gmenk46) 



( PGI 



17ompPRR 



3RR J 



UgsUon 



p-gal117S4H 
Linker peptide 




Fig. 34 
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VBR -4K -4A -4» -4Q -4D -4B 
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BShANP 



p-gal 97S 



Linker peptide 



a-hANP 



► 




-1 


+1 


Phe Arz 


^Ser 





-10 



ijrr nsp n» 

I 1 



BShXHHi Clh Met His Gly Ant Asp Att Gin Phe Aik Scr Leu Arj Ai^ 



Rg- 36 
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ttePO 



P10 

pGRShANP 



Ptft STOCOGAQCATC^CQATGAGCr (S6Q ID NO:U) 

P44: rATOCACQOrcCrCMTCOrCAATTCCGTAGCr (SEQ ID NOtlOO) 



PCR With primers 
P10andP44 



Bgm-EoolU digestton 



AePO 



Eeom 



pQRSKANP 



P3 



£caRl 




Fragment 48 
pGRShANP 



P3: TACOOCACaQCTTTACACTTTAS- (S6Q ID N0:6) 



PCR wKh fragment 48 
and primer P3 




Ugation 




■H p-gal97S 
BH Linker peptide 
CD Of-hANP 



Tc' 



Rg. 37 
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